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Preface 


Elementary number theory is concerned with the arithmetic properties of 
the ring of integers, Z, and its field of fractions, the rational numbers, Q. 
Early on in the development of the subject it was noticed that Z has many 
properties in common with A = F(T], the ring of polynomials over a finite 
field. Both rings are principal ideal domains, both have the property that 
the residue class ring of any non-zero ideal is finite, both rings have infinitely 
many prime elements, and both rings have finitely many units. Thus, one 
is led to suspect that many results which hold for Z have analogues of 
the ring A. This is indeed the case. The first four chapters of this book 
are devoted to illustrating this by presenting, for example, analogues of 
the little theorems of Fermat and Euler, Wilson’s theorem, quadratic (and 
higher) reciprocity, the prime number theorem, and Dirichlet’s theorem on 
primes in an arithmetic progression. All these results have been known for 
a long time, but it is hard to locate any exposition of them outside of the 
original papers. 

Algebraic number theory arises from elementary number theory by con- 
sidering finite algebraic extensions K of Q, which are called algebraic num- 
ber fields, and investigating properties of the ring of algebraic integers 
Ox C K, defined as the integral closure of Z in K. Similarly, we can con- 
sider k = F(T), the quotient field of A and finite algebraic extensions L of 
k. Fields of this type are called algebraic function fields. More precisely, an 
algebraic function fields with a finite constant field is called a global func- 
tion field. A global function field is the true analogue of algebraic number 
field and much of this book will be concerned with investigating proper- 
ties of global function fields. In Chapters 5 and 6, we will discuss function 
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fields over arbitrary constant fields and review (sometimes in detail) the 
basic theory up to and including the fundamental theorem of Riemann- 
Roch and its corollaries. This will serve as the basis for many of the later 
developments. 

It is important to point out that the theory of algebraic function fields 
is but another guise for the theory of algebraic curves. The point of view 
of this book will be very arithmetic. At every turn the emphasis will be 
on the analogy of algebaic function fields with algebraic number fields. 
Curves will be mentioned only in passing. However, the algebraic-geometric 
point of view is very powerful and we will freely borrow theorems about 
algebraic curves (and their Jacobian varieties) which, up to now, have no 
purely arithmetic proof. In some cases we will not give the proof, but will 
be content to state the result accurately and to draw from it the needed 
arithmetic consequences. 

This book is aimed primarily at graduate students who have had a good 
introductory course in abstract algebra covering, in addition to Galois the- 
ory, commutative algebra as presented, for example, in the classic text of 
Atiyah and MacDonald. In the interest of presenting some advanced re- 
sults in a relatively elementary text, we do not aspire to prove everything. 
However, we do prove most of the results that we present and hope to in- 
spire the reader to search out the proofs of those important results whose 
proof we omit. In addition to graduate students, we hope that this material 
will be of interest to many others who know some algebraic number the- 
ory and/or algebraic geometry and are curious about what number theory 
in function field is all about. Although the presentation is not primarily 
directed toward people with an interest in algebraic coding theory, much 
of what is discussed can serve as useful background for those wishing to 
pursue the arithmetic side of this topic. 

Now for a brief tour through the later chapters of the book. 

Chapter 7 covers the background leading up to the statement and proof 
of the Riemann-Hurwitz theorem. As an application we discuss and prove 
the analogue of the ABC conjecture in the function field context. This 
important result has many consequences and we present a few applications 
to diophantine problems over function fields. 

Chapter 8 gives the theory of constant field extensions, mostly under the 
assumption that the constant field is perfect. This is basic material which 
will be put to use repeatedly in later chapters. 

Chapter 9 is primarily devoted to the theory of finite Galois extensions 
and the theory of Artin and Hecke L-functions. Two versions of the very 
important T’chebatorov density theorem are presented: one using Dirichlet 
density and the other using natural density. Toward the end of the chapter 
there is a sketch of global class field theory which enables one, in the abelian 
case, to identify Artin [-series with Hecke L-series. 

Chapter 10 is devoted to the proof of a theorem of Bilharz (a studentof 
Hasse) which is the function field version of Artin’s famous conjecture on 
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primitive roots. This material, interesting in itself, illustrates the use of 
many of the results developed in the preceding chapters. 

Chapter 11 discusses the behavior of the class group under constant field 
extensions. It is this circle of ideas which led Iwasawa to develop “Iwasawa 
theory,” one of the most powerful tools of modern number theory. 

Chapters 12 and 13 provide an introduction to the theory of Drinfeld 
modules. Chapter 12 presents the theory of the Carlitz module, which was 
developed by L. Carlitz in the 1930s. Drinfeld’s papers, published in the 
1970s, contain a vast generalization of Carlitz’s work. Drinfeld’s work was 
directed toward a proof of the Langlands’ conjectures in function fields. 
Another consequence of the theory, worked out separately by Drinfeld and 
Hayes, is an explicit class field theory for global function fields. These chap- 
ters present the basic definitions and concepts, as well as the beginnings of 
the general theory. 

Chapter 14 presents preliminary material on S-units, S-class groups, and 
the corresponding L-functions. This leads up to the statement and proof of 
a special case of the Brumer-Stark conjecture in the function field context. 
This is the content of Chapter 15. The Brumer-Stark conjecture in function 
fields is now known in full generality. There are two proofs — one due to 
Tate and Deligne, another due to Hayes. It is the author’s hope that anyone 
who has read Chapters 14 and 15 will be inspired to go on to master one 
or both of the proofs of the general result. 

Chapter 16 presents function field analogues of the famous class number 
formulas of Kummer for cyclotomic number fields together with variations 
on this theme. Once again, most of this material has been generalized 
considerably and the material in this chapter, which has its own interest, 
can also serve as the background for further study. 

Finally, in Chapter 17 we discuss average value theorems in global fields. 
The material presented here generalizes work of Carlitz over the ring A = 
F(T]. A novel feature is a function field analogue of the Wiener-Ikehara 
Tauberian theorem. The beginning of the chapter discusses average values 
of elementary number-theoretic functions. The last part of the chapter deals 
with average values for class numbers of hyperelliptic function fields. 

In the effort to keep this book reasonably short, many topics which could 
have been included were left out. For example, chapters had been contem- 
plated on automorphisms and the inverse Galois problem, the number of 
rational points with applications to algebraic coding theory, and the theory 
of character sums. Thought had been given to a more extensive discussion 
of Drinfeld modules and the subject of explicit class field theory in global 
fields. Also omitted is any discussion of the fascinating subject of transcen- 
dental numbers in the function field context (for an excellent survey see J. 
Yu [1]). Clearly, number theory in function fields is a vast subject. It is of 
interest for its own sake and because it has so often served as a stimulous to 
research in algebraic number theory and arithmetic geometry. We hope this 
book will arouse in the reader a desire to learn more and explore further. 
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Polynomials over Finite Fields 


In all that follows F will denote a finite field with g elements. The model for 
such a field is Z/pZ, where p is a prime number. This field has p elements. 
In general the number of elements in a finite field is a power of a prime, 
q= pl. Of course, p is the characteristic of F. 

Let A = F(T], the polynomial ring over F. A has many properties in 
common with the ring of integers Z. Both are principal ideal domains, both 
have a finite unit group, and both have the property that every residue class 
ring modulo a non-zero ideal has finitely many elements. We will verify all 
this shortly. The result is that many of the number theoretic questions we 
ask about Z have their analogues for A. We will explore these in some 
detail. 

Every element in A has the form f(T) = aoI” +a ,T"1' +--+ + Qn. 
If ao A 0 we say that f has degree n, notationally deg(f) = n. In this 
case we set sgn(f) = ao and call this element of F* the sign of f. Note 
the following very important properties of these functions. If f and g are 
non-zero polynomials we have 


deg(fg) = deg(f) + deg(g) and sgn(fg) = sgn(f)sgn(g). 


deg(f +g) < max(deg(f), deg(g)). 


In the second line, equality holds if deg(f) 4 deg(g). 

If sgn(f) = 1 we say that f is a monic polynomial. Monic polynomials 
play the role of positive integers. It is sometimes useful to define the sign of 
the zero polynomial to be 0 and its degree to be —oo. The above properties 
of degree then remain true without restriction. 
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Proposition 1.1. Let f,g € A with g # 0. Then there exist elements 
q,r € A such that f = qg+r andr is either 0 or deg(r) < deg(g). 
Moreover, q andr are uniquely determined by these conditions. 


Proof. Let n = deg(f), m = deg(g), a = sgn(f), @ = sgn(g). We give 
the proof by induction on n = deg(f). Ifn <_m, set q= 0 and r = f. If 
n> ™m, we note that f; = f —aG-!T"-™g has smaller degree than f. By 
induction, there exist q1,71 € Asuch that f; = qig+ri with r; being either 
0 or with degree less than deg(g). In this case, set gq = a8~-!T"-™ + q, and 
r =r, and we are done. 

If f =qg+r=q'g+r’, then g divides r—r’ and by degree considerations 
we see r = r’. In this case, gg = q’g so g = q’ and the uniqueness is 
established. 


This proposition shows that A is a Euclidean domain and thus a principal 
ideal domain and a unique factorization domain. It also allows a quick proof 
of the finiteness of the residue class rings. 


Proposition 1.2. Suppose g € A andg #0. Then A/gA is a finite ring 
with gt°8(9) elements. 


Proof. Let m = deg(g). By Proposition 1.1 one easily verifies that {r € 
A | deg(r) < m } is a complete set of representatives for A/gA. Such 
elements look like 


r= aot” } + a,T"~* +++ +Qm-1 witha; € F. 


Since the a; vary independently through F there are g™ such polynomials 
and the result follows. 


Definition. Let g € A. If g £0, set |g| = q9°8. If g = 0, set |g| = 0. 


|g| is a measure of the size of g. Note that if n is an ordinary integer, then 
its usual absolute value, |n|, is the number of elements in Z/nZ. Similarly, 
|g| is the number of elements in A/gA. It is immediate that |fg| = |f| |g| 
and |f +g| < max(|f|,|g|) with equality holding if |f| 4 |g|. 

It is a simple matter to determine the group of units in A, A*. If g 
is a unit, then there is an f such that fg = 1. Thus, 0 = deg(1) = 
deg(f) + deg(g) and so deg(f) = deg(g) = 0. The only units are the non- 
zero constants and each such constant is a unit. 


Proposition 1.3. The group of units in A is F*. In particular, it is a finite 
cyclic group with q —1 elements. 


Proof. The only thing left to prove is the cyclicity of F*. This follows from 
the very general fact that a finite subgroup of the multiplicative group of 
a field is cyclic. 


In what follows we will see that the number gq —1 often occurs where the 
number 2 occurs in ordinary number theory. This stems from the fact that 
the order of Z* is 2. | 


1. Polynomials over Finite Fields 3 


By definition, a non-constant polynomial f € A is irreducible if it cannot 
be written as a product of two polynomials, each of positive degree. Since 
every ideal in A is principal, we see that a polynomial is irreducible if and 
only if it is prime (for the definitions of divisibility, prime, irreducible, etc., 
see Ireland and Rosen |1]). These words will be used interchangeably. Every 
non-zero polynomial can be written uniquely as a non-zero constant times 
a monic polynomial. Thus, every ideal in A has a unique monic generator. 
This should be compared with the statement that evey non-zero ideal in Z 
has a unique positive generator. Finally, the unique factorization property 
in A can be sharpened to the following statement. Every f € A, f #0, can 
be written uniquely in the form 


f =aPO PS... Pe, 


where a € F* , each P; is a monic irreducible, P; 4 P; for 1 4 j, and each 
e; is a non-negative integer. 

The letter P will often be used for a monic irreducible polynomial in A. 
We use P instead of p since the latter letter is reserved for the characteristic 
of F. This is a bit awkward, but it is compensated for by being less likely 
to lead to confusion. 

The next order of business will be to investigate the structure of the 
rings A/fA and the unit groups (A/fA)*. A valuable tool is the Chinese 
Remainder Theorem. 


Proposition 1.4. Let m,,mo,...,m;, be elements of A which are pairwise 
relatively prime. Let m = mymo...m and ¢; be the natural homomor- 


phism from A/mA to A/m;A. Then, the map ¢: A/mA > A/m A ® 
A/mA®::-@A/mMA given by 


(a) = ($1 (a), b2(a), ve , $;(@)) 

1S a ring isomorphism. 
Proof. This is a standard result which holds in any principal ideal domain 
(properly formulated it holds in much greater generality). 
Corollary. The same map ¢ restricted to the units of A, A*, gives rise to 
a group isomorphism 

(A/mA)* ~ (A/m,A)* x (A/m2A)* x +++ x (A/m:A)*. 
Proof. This is a standard exercise. See Ireland and Rosen [1], Proposition 


3.4.1. 


Now, let f € A be non-zero and not a unit and suppose that f = 
aPy! Py? ... Pf is its prime decomposition. From the above considerations 
we have 


(A/ fA)" = (A/PytA)* x (A/ P37 A)* x ++ x (A/ Pe A)™. 
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This isomorphism reduces our task to that of determining the structure 
of the groups (A/P*°A)* where P is an irreducible polynomial and e is a 
positive integer. When e = 1 the situation is very similar to that is Z. 


Proposition 1.5. Let P € A be an irreducible polynomial. Then, (A/PA)* 
is a cyclic group with |P| —1 elements. 


Proof. Since A is a principal ideal domain, PA is a maximal ideal and so 
A/PA is a field. A finite subgroup of the multiplicative group of a field is 
cyclic. Thus (A/PA)* is cyclic. That the order of this group is |P| — 1 is 
immediate. 


We now consider the situation when e > 1. Here we encounter something 
which is quite different in A from the situation in Z. If p is an odd prime 
number in Z then it is a standard result that (Z/p°Z)* is cyclic for all 
positive integers e. If p= 2 and e > 3 then (Z/2°Z)* is the direct product 
of a cyclic group of order 2 and a cyclic group of order 2°~?. The situation 
is very different in A. 


Proposition 1.6. Let P € A be an irreducible polynomial and e a positive 
integer. The order of (A/P*A)* is |P|¢-1(|P| — 1). Let (A/P*A)™ be the 
kernel of the natural map from (A/P°A)* to (A/PA)*. It ts a p-group of 
order |P|\¢-!. As e tends to infinity, the minimal number of generators of 


(A/P°A) tends to infinity. 


Proof. The ring A/P*A has only one maximal ideal PA/P°A which has 
|P|°—! elements. Thus, (A/P*A)* = A/P®A—PA/P®*A has |P|*°—|P|¢-! = 
|P|¢-+(|P| — 1) number of elements. Since (A/P®A)* + (A/PA)* is onto, 
and the latter group has |P| — 1 elements the assertion about the size of 
(A/P*A)") follows. It remains to prove the assertion about the minimal 
number of generators. 


It is instructive to first consider the case e = 2. Every element in 
(A/P?.A)“) can be represented by a polynomial of the form a = 1 + bP. 
Since we are working in characteristic p we have a? = 1+ 6?P?P = 1 


(mod P*). So, we have a group of order |P| with exponent p. If q = p! it 
follows that (A/P?A)®) is a direct sum of f deg(P) number of copies of 
Z/p&Z. This is a cyclic group only under the very restrictive conditions that 
q = p and deg(P) = 1. 

To deal with the general case, suppose that s is the smallest integer such 
that p® > e. Since (1+ DP)?” = 1+ (bP)? = 1 (mod P®) we have that 
raising to the p’-power annihilates G = (A/P°A)"). Let d be the minimal 
number of generators of this group. It follows that there is an onto map 
from (Z/p*Z)* onto G. Thus, p** > pf dee)(e-) and so 


d> fdeg(P)le- 1) 


Since s is the smallest integer bigger than or equal to log, (e) it is clear that 
d—- oo as e€—-> Oo. 
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It is possible to do a much closer analysis of the structure of these groups, 
but it is not necessary to do so now. The fact that these groups get very 
complicated does cause problems in the more advanced parts of the theory. 

We have developed more than enough material to enable us to give in- 
teresting analogues of the Euler ¢-function and the little theorems of Euler 
and Fermat. 

To begin with, let f € A be a non-zero polynomial. Define ®(f) to 
be the number of elements in the group (A/fA)*. We can give another 
characterization of this number which makes the relation to the Euler ¢- 
function even more evident. We have seen that {r € A | deg(r) < deg(f)} 
is a set of representatives for A/fA. Such an r represents a unit in A/fA if 
and only if it is relatively prime to f. Thus ®(f) is the number of non-zero 
polynomials of degree less than deg(f) and relatively prime to f. 


Proposition 1.7. 
1 
6(f) =|fl [[a- P| 
P\f 
Proof. Let f = aP;*P;?...Py* be the prime decomposition of f. By the 
corollary to Propositions 1.4 and by Proposition 1.6, we see that 


t t 


of) = Tar) = Tar, 


i=1 1i=1 


€, |P; ex 1) 


from which the result follows immediately. 


The similarity of the formula in this proposition to the classical formula 
for $(n) is striking. 


Proposition 1.8. If f ¢ A, f #0, andaé A is relatively prime to f, i.e., 
(a, f) =1, then 
a®) =1 (mod f). 


Proof. The group (A/fA)* has ®(f) elements. The coset of a modulo f, a, 
lies in this group. Thus, a?) = I and this is equivalent to the congruence 
in the proposition. 


Corollary. Let P € A be irreducible anda € A be a polynomial not divisible 
by P. Then, 
alPI-1 = 1° (mod P). 


Proof. Since P is irreducible, it is relatively prime to a if and only if it 
does not divide a. The corollary follows from the proposition and the fact 
that for an irreducible P, ®(P) = |P| — 1 (Proposition 1.5). 


It is clear that Proposition 1.8 and its corollary are direct analogues of 
Euler’s little theorem and Fermat’s little theorem. They play the same very 
important role in this context as they do in elementary number theory. By 
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way of illustration we proceed to the analogue of Wilson’s theorem. Recall 
that this states that (p — 1)! = —1 (mod p) where p is a prime number. 


Proposition 1.9. Let P € A be irreducible of degree d. Suppose X is an 
indeterminate. Then, 


xlPrt_y= [|] (X-f) (mod P). 


O<deg(f)<d 


Proof. Recall that {f € A | deg(f) < d} is a set of representatives for the 
cosets of A/PA. If we throw out f = 0 we get a set of representatives for 


(A/PA)*. We find 


xW'—-t=  [[ (X-A, 


O<deg(f)<d 


where the bars denote cosets modulo P. This follows from the corollary to 
Proposition 1.8 since both sides of the equation are monic polynomials in 
X with the same set of roots in the field A/PA. Since there are |P| — 1 
roots and the difference of the two sides has degree less than |P| — 1, the 
difference of the two sides must be 0. The congruence in the Proposition is 
equivalent to this assertion. 


Corollary 1. Let d divide |P| — 1. The congruence X* = 1 (mod P) 
has exactly d solutions. Equivalently, the equation X¢ = 1 has exactly d 
solutions in (A/PA)*. 


Proof. We prove the second assertion. Since d | |P| — 1 it follows that 
X¢ —1 divides X'P|-1 —1. By the proposition, the latter polynomial splits 
as a product of distinct linear factors. Thus so does the former polynomial. 
This establishes the result. 


Corollary 2. With the same notation, 


lI f= -1 (mod P), 


O<deg(f)<deg P 


Proof. Just set X = 0 in the proposition. If the characteristic of F is odd 
|P| — 1 is even and the result follows. If the characteristic is 2 then the 
result also follows since in characteristic 2 we have —1 = 1. 


The above corollary is the polynomial version of Wilson’s theorem. It’s 
interesting to note that the left-hand side of the congruence only depends 
on the degree of P and not on P itself. 

As a final topic in this chapter we give some of the theory of d-th power 
residues. This will be of importance in Chapter 3 when we discuss quadratic 
reciprocity and more general reciprocity laws for A. 
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If f € A is of positive degree and a € A is relatively prime to f, we say 
that a is a d-th power residue modulo f if the equation 4 = a (mod f) is 
solvable in A. Equivalently, a is a d-th power in (A/fA)*. 

Suppose f = aP;1P;*...Py* is the prime decomposition of f. Then it 
is easy to check that a is a d-th power residue modulo f if and only if a 
is a d-th power residue modulo P,* for all 1 between 1 and t. This reduces 
the problem to the case where the modulus is a prime power. 


Proposition 1.10. Let P be irreducible and a € A not divisible by P. 
Assume d divides |P|—1. The congruence X“ =a (mod P®) is solvable if 
and only if 


at =1 (mod P). 


There are a) 


d-th power residues modulo P®. 


Proof. Assume to begin with that e = 1. 

If b¢ = a (mod P), then qm = plPl-l=1 (mod P) by the corollary 
to Proposition 1.8. This shows the condition is necessary. To show it is 
sufficient recall that by Corollary 1 to Proposition 1.9 all the d-th roots of 
unity are in the field A/PA. Consider the homomorphism from (A/P.A)* 
to itself given by raising to the d-th power. It’s kernel has order d and its 


image is the d-th powers. Thus, there are precisely P=} d-th powers in 


(A/PA)*. We have seen that they all satisfy X “Fr -1=0. Thus, they 
are precisely the roots of this equation. This proves all assertions in the 
case e = 1. 

To deal with the remaining cases, we employ a little group theory. The 
natural map (i.e., reduction modulo P) is a homomorphism from (A/P*®A)* 
onto (A/PA)* and the kernel is a p-group as follows from Proposition 
1.6. Since the order of (A/PA)* is |P| — 1 which is prime to p it follows 
that (A/P*A)* is the direct product of a p-group and a copy of (A/PA)*. 
Since (d,p) = 1, raising to the d-th power in an abelian p-group is an 
automorphism. Thus, a € A is a d-th power modulo P® if and only if it 
is a d-th power modulo P. The latter has been shown to hold if and only 
faa =1 (mod P). Now consider the homomorphism from (A/P®A)* 
to itself given by raising to the d-th power. It easily follows from what has 
been said that the kernel has d elements and the image is the subgroup of 
d-th powers. It follows that the latter group has order oP ~). This concludes 
the proof. 


Exercises 
1. {me A=FIT], and deg(m) > 0, show that g—1 | ®(m). 


2. If q = p is a prime number and P € A is an irreducible, show 
(F[T']/P?A)* is cyclic if and only if deg P = 1. 


10. 


11. 


12. 


. Show |] 
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. Suppose m € A is monic and that m = m,mz is a factorization into 


two monics which are relatively prime and of positive degree. Show 
(A/mA)* is not cyclic except possibly in the case gq = 2 and m, and 
my have relatively prime degrees. 


. Assume q # 2. Determine all m for which (A/mA)* is cyclic (see the 


proof of Proposition 1.6). 


. Suppose d | g — 1. Show x* = —1 (mod P) is solvable if and only if 


(—1)at dee? = 


acre & = —1. 


. Let P€ A be a monic irreducible. Show 


I] f=+1 (mod P), 
deg f<d 
f monic 


where d = deg P. Determine the sign on the right-hand side of this 
congruence. 


. For an integer m > 1 define [m] = T?” — T. Show that [m] is the 


product of all monic irreducible polynomials P(T’) such that deg P(T) 
divides m. 


. Working in the polynomial ring Fluo, wi,...,un], define D(uo, u1, 


..+,)Un) = det ut |, where 2,7 = 0,1,...,n”. This is called the Moore 
determinant. Show 


Tr 
D(uo,U1,-++,Un) = I] I] ve ] | 4 + ei-1ui-1 +++++ pug) . 
1=0c,_1EF co€F 


Hint: Show each factor on the right divides the determinant and then 
count degrees. 


Define F; = []J22j(T” —T%) = [229 [7 — i]®. Show that 
D(1,T,T’,...,T") =|] F;. 
j=0 


Hint: Use the fact that D(1,T,T?,...,7") can be viewed as a 
Vandermonde determinant. 


Show that Ff is the product of all monic polynomials in A of degee 
j. 


Define L; = 1 (Tr —-T)= 1 si). Use Exercise 8 to prove that 
L,; is the least common multiple of all monics of degree 7. 


13. 


14. 


15. 


16. 
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Show DLT. T? TA1, y) 
1J,i*,...,£° -,u 
IT @+f= DLT,T,...,.TH!) 


deg f<d 
Deduce from Exercise 13 that 


I] >a ‘aie | 


deg f<d 


Show that the product of all the non-zero polynomials of degree less 
than d is equal to (—1)¢4Fy/Lq . 


Prove that 


d 
hf, UV _ _1)\3 La q? 
« II ( i) XY F,L%. — 


deg f<d 4=0 


In the product the term corresponding to f = 0 is omitted. 


2 


Primes, Arithmetic Functions, 
and the Zeta Function 


In this chapter we will discuss properties of primes and prime decomposition 
in the ring A = F/T]. Much of this discussion will be facilitated by the use 
of the zeta function associated to A. This zeta function is an analogue of 
the classical zeta function which was first introduced by L. Euler and whose 
study was immeasurably enriched by the contributions of B. Riemann. In 
the case of polynomial rings the zeta function is a much simpler object and 
its use rapidly leads to a sharp version of the prime number theorem for 
polynomials without the need for any complicated analytic investigations. 
Later we will see that this situation is a bit deceptive. When we investigate 
arithmetic in more general function fields than F(T’), the corresponding 
zeta function will turn out to be a much more subtle invariant. 


Definition. The zeta function of A, denoted ¢,4(s), is defined by the infinite 


serles 
Ca(s) = 3 7 ’ 


fEAa 
f monic 


There are exactly gq? monic polynomials of degree d in A, so one has 


2 
So iflt=1t+ 5444-45, 
deg(f)<d q 4 d 


and consequently 
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for all complex numbers s with #(s) > 1. In the classical case of the Rie- 
mann zeta function, ¢(s) = S°>,n7*, it is easy to show the defining 
series converges for R(s) > 1, but it is more difficult to provide an analytic 
continuation. Riemann showed that it can be analytically continued to a 
meromorphic function on the whole complex plane with the only pole be- 
ing a simple pole of residue 1 at s = 1. Moreover, if ['(s) is the classical 
gamma function and €(s) = 7~2I°($)¢(s), Riemann showed the functional 
equation €(1 — s) = €(s). What can be said about ¢,4(s)? 

By Equation 1 above, we see clearly that ¢4(s), which is initially defined 
for R(s) > 1, can be continued to a meromorphic function on the whole 
complex plane with a simple pole at s = 1. A simple computation shows 
that the residue at s = 1 is DEG)" Now define £4(s) = q7*(1—q78)71¢,(s). 
It is easy to check that €4(1—s) = €,(s) so that a functional equation holds 
in this situation as well. As opposed to case of the classical zeta-function, 
the proofs are very easy for ¢,4(s). Later we will consider generalizations of 
Ca(s) in the context of function fields over finite fields. Similar statements 
will hold, but the proofs will be more difficult and will be based on the 
Riemann-Roch theorem for algebraic curves. 

Euler noted that the unique decomposition of integers into products of 
primes leads to the following identity for the Riemann zeta-function: 


p prime 
p>O 


This is valid for R(s) > 1. The exact same reasoning (which we won’t 
repeat here) leads to the following identity: 


ft —1 
mals) - P Ho - PP | 0) 


P monic 


This is also valid for all R(s) > 1. 

One can immediately put Equation 2 to use. Suppose there were only 
finitely many irreducible polynomials in A. The right-hand side of the equa- 
tion would then be defined at s = 1 and even have a non-zero value there. 
On the other hand, the left hand side has a pole at s = 1. This shows there 
are infinitely many irreducibles in A. One doesn’t need the zeta-function 
to show this. Euclid’s proof that there are infinitely many prime integers 
works equally well in polynomial rings. Suppose S is a finite set of irre- 
ducibles. Multiply the elements of S together and add one. The result is 
a polynomial of positive degree not divisible by any element of S. Thus, 
S cannot contain all irreducible polynomials. It follows, once more, that 
there are infinitely many irreducibles. 

Let x be areal number and 7(z) be the number of positive prime numbers 
less than or equal to xz. The classical prime number theorem states that 
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m(x) is asymptotic to x/log(x). Let d be a positive integer and x = q?. We 
will show that the number of monic irreducibles P such that |P| = z is 
asymptotic to x/log,(x) which is clearly in the spirit of the classical result. 

Define ag to be the number of monic irreducibles of degree d. Then, from 
Equation 2 we find 


¢a(s) = ] [GQ -¢%)-%. 
d=1 


If we recall that ¢4(s) = 1/(1 — q'~S) and substitute u = q7* (note that 
|u| < 1 if and only if R(s) > 1) we obtain the identity 


Taking the logarithmic derivative of both sides and multiplying the result 
by w yields 


qu => ee ut 
l-qu &~ 1— ut 


Finally, expand both sides into power series using the geometric series and 
compare coefficients of uw”. The result is the beautiful formula, 


S— dag = q’. 


d|n 


Proposition 2.1. 


This formula is often attributed to Richard Dedekind. It is interesting to 
note that it appears, with essentially the above proof, in a manuscript of 
C.F. Gauss (unpublished in his lifetime), “Die Lehre von den Resten.” See 
Gauss [1], pages 608-611. 


Corollary 


am == So uldat (3) 


d\n 


Proof. This formula follows by applying the Mobius inversion formula to 
the formula given in the proposition. 


The formula in the above proposition can also be proven by means of 
the algebraic theory of finite fields. In fact, most books on abstract alge- 
bra contain the formula and the purely algebraic proof. The zeta-function 
approach has the advantage that the same method can be used to prove 
many other things as we shall see in this and later chapters. 

The next task is to write a, in a way which makes it easy to see how big 
it is. In Equation 3 the highest power of g that occurs is gq” and the next 
highest power that may occur is g? (this occurs if and only if 2|n. All the 
other terms have the form +qg™ where m < }. The total number of terms is 
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>Ja\n |4(4)|, which is easily seen to be 2°, where ¢ is the number of distinct 
prime divisors of n. Let pi, po,...,pe be the distinct primes dividing n. 
Then, 2° < pypo...p; <n. Thus, we have the following estimate: 


Using the standard big O notation, we have proved the following theorem. 


Theorem 2.2. (The prime number theorem for polynomials) Let a, denote 
the number of monic irreducible polynomials in A = F(T] of degree n. Then, 


m= E+0(4), 
n n 


Note that if we set x = q” the right-hand side of this equation is 
t/log,(x) + O(./x/log,(z)) which looks like the conjectured precise form 
of the classical prime number theorem. This is still not proven. It depends 
on the truth of the Riemann hypothesis (which will be discussed later). 

We now show how to use the zeta function for other counting problems. 
What is the number of square-free monics of degree n? Let this number be 
b,,. Consider the product 


[0+ pp =D Ge 4 


P 


As usual, the product is over all monic irreducibles P and the sum is over 
all monics f. We will maintain this convention unless otherwise stated. 
The function 6(f) is 1 when f is square-free, and 0 otherwise. This is 
an easy consequence of unique factorization in A and the definition of 
square-free. Making the substitution u = q~* once again, the right-hand 
side of Equation 4 becomes }>*_,b,u™. Consider the identity 1+ w = 
(1 — w*)/(1 — w). If we substitute w = |P|~* and then take the product 
over all monic irreducibles P, we see that the left-hand side of Equation 4 
is equal to ¢4(s)/C4(2s) = (1 — g'~*8)/(1 — q'~8). Putting everything in 
terms of wu leads to the identity 


— = Doha 


Finally, expand the left-hand side in a geometric series and compare the 
coefficients of u” on both sides. We have proven— 


Proposition 2.3. Let b, be the number of square-free monics in A of 
degree n. Then b} =q and forn > 1, by =q"(1—q7?). 


It is amusing to compare this result with what is known to be true in 
Z. If B, is the number of positive square-free integers less than or equal 
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to n, then limyn-3oo Bn /n = 6/7?. In less precise language, the probability 
that a positive integer is square-free is 6/7*. The probablity that a monic 
polynomial of degree n is square-free is b,/q”, and this equals (1 — q~') 
for n > 1. Thus the probabilty that a monic polynomial in A is square- 
free is (1 — q~'). Now, 6/n? = 1/€(2), so it is interesting to note that 
(1 —q~!) = 1/¢,(2). This is, of course, no accident and one can give good 
heuristic reasons why this should occur. The interested reader may want 
to find these reasons and to investigate the probablity that a polynomial 
be cube-free, fourth-power-free, etc. 

Our next goal is to introduce analogues of some well-known number- 
theoretic functions and to discuss their properties. We have already in- 
troduced ®(f). Let u(f) be 0 if f is not square-free, and (—1)’ if f is a 
constant times a product of ¢ distinct monic irreducibles. This is the poly- 
nomial version of the Mobius function. Let d(f) be the number of monic 
divisors of f and o(f) = )/,);|9| where the sum is over all monic divisors 
of f. 

These functions, like their classical counterparts, have the property of 
being multiplicative. More precisely, a complex valued function A on A—{0} 
is called multiplicative if A( fg) = A(f)A(g) whenever f and g are relatively 
prime. We assume A is 1 on F*. Let 


f =aPy'P,;?... Py 
be the prime decomposition of f. If \ is multiplicative, 
Af) = MPH )APS?).. APH). 


Thus, a multiplicative function is completely determined by its values on 
prime powers. Using multiplicativity, one can derive the following formulas 
for these functions. 


Proposition 2.4. Let the prime decomposition of f be given as above. 


Then, 
of) = |fl[[@- Pi}, 
P\f 
d(f) = (e1+1)(e2+1)...(e,+1). 
Piatt —1 |Pojett—1 |[Pjett—4 
o(f) = 


[Pi] -1 PJ-1 [PJ -1 


Proof. The formula for ®(n) has already been given in Proposition 1.7. 
If P is a monic irreducible, the only monic divisors of P*® are 1, P, 
P?,...,P® so d(P*) =e+1 and the second formula follows. 
By the above paragraph, o(P*) = 1+ |P| + |P|? +...|P\® = 
(|P|¢t! —1/(|P| —1), and the formula for o(f) also follows. 


As a final topic in this chapter we shall introduce the notion of the 
average values in the context of polynomials. Suppose h(x) is a complex- 
valued function on N, the set of positive integers. Suppose the following 
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limit exists 


We then define a to be the average value of the function h. For example, 
suppose h(n) = 1 if n is square-free and 0 otherwise. Then, as noted above, 
the average value of h is known to be 6/7”. The sum 5-;_, A(k) sometimes 
grows too fast for the average value to exist. Often though, one can show 
the growth is dominated by a simple function of n. An example of this is 
the Euler ¢-function. One can show 


7m 


S- d(k) = Sn + O(nlog(n)). 


k=1 


For this and other results of a similar nature, see Chapter VIII of the classic 
book by G.H. Hardy and E.M. Wright, Hardy and Wright [1]. Another good 
reference for this material is Chapter 3 of Apostol [1]. 

In the ring A the analogue of the positive integers is the set of monic 
polynomials. Let h(x) be a function on the set of monic polynomials. For 
n > 0 we define 1 

Ave, (h) = qn S_ Af). 

f monic 
deg(f)=n 
This is clearly the average value of h on the set of monic polynomials of 
degree n. We define the average value of h to be limn_,o, Ave, (h) provided 
this limit exists. This is the natural way in which average values arise in 
the context of polynomials. It is an exercise to show that if the average 
value exists in the sense just given, then it is also equal to the following 
limit: 1 


f monic 
deg(f)<n 
As we pointed out above, this limit does not always exist. However, even 
when it doesn’t exist, one can speak of the average rate of growth of h(f). 
Define H(n) to equal the sum of h(f) over all monic polynomials of degree 
n. As we will see, the function H(n) sometimes behaves in a quite regular 
manner even though the values h(f) vary erratically. 
Instead of approaching these problems directly we use the method of 
Carlitz which uses Dirichlet series. Given a function h as above, we define 
the associated Dirichlet series to be 


D,(s) _ 3 h(f) — > H(n) (5) 


\f\s qrs ° 


fmonic 


In what follows, we will work in a formal manner with these series. If one 
wants to worry about convergence, it is useful to remark that if |h(f)| = 
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O(|f|?), then D;(s) converges for #(s) > 14+ 8. The proof just uses the 
comparison test and the fact that ¢,4(s) converges for #(s) > 1. 

The right-hand side of 5 is simply )~°°_, H(n)u”, so the Dirichlet series 
in s becomes a power series in w whose coefficients are the averages H(n). 
To see how this is useful, recall the function d(f) which is the number of 
monic divisors of f. Let D(n) be the sum of d(f) over all monics of degree 
n (hopefully, this notation will not cause too much confusion). Then, 


Proposition 2.5. Da(s) = ¢a(s)? = (1—qu)~? . Consequently, D(n) = 


(n + 1)q”. 
Proof. 1 ' 
ca? = (2 ip) (2 gp) = 
ECE gp Ep 
how f f 


This proves the first assertion. To prove the second assertion, notice 
Oo 
Da(s) = )> D(n)u” = (1—qu)? . 
n=0 


It is easily seen that (1 — qu)~* = °°, (n + 1)q"u”. Thus, the second 
assertion follows by comparing the coefficients of u™ on both sides of this 
identity. 


A few remarks are in order. Notice that Ave,(d) = +1 so the average 
value of d(f) in the way we have defined it doesn’t exist. On average, the 
number of divisors of f grows with the degree. If we set x = q” then our 
result reads D(n) = xlog,(x) + x which resembles closely the analogous 
result for the integers }7,_, d(k) = xlog(x) + (2y — 1)z + O(,/az) (here 
y & .577216 is Euler’s constant). This formula is due to Dirichlet. It is 
a famous problem in elementary number theory to find the best possible 
error term. In the polynomial case, there is no error term! This is because 
of the very simple nature of the zeta function ¢4(s). Similar sums in the 
general function field context lead to more difficult problems. We shall have 
more to say in this direction in Chapter 17. 

It is an interesting fact that many multiplicative functions have corre- 
sponding Dirichlet series which can be simply expressed in terms of the 
zeta function. We have just seen this for d(f). More generallly, let h(f) be 
multiplicative. The multiplicativity of h(f) leads to the identity 


D,(s) = O(S ao | 


P k=0 
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As an example, consider the function pu(f). Since )>7"_9 wr = =1-|P\~°, 


we find D,,(s) = ¢a(s)~*. The same method would enable us to determine 
the Dirichlet series for ®(f) and o(f). However, we will follow a slightly 
different path to this goal. 

Let A and p be two complex valued functions on the monic polynomials. 
We define their Dirichlet product by the following formula (all polynomials 
involved are assumed to be monic) 


This definition is exactly similar to the corresponding notion in elemen- 
tary number theory. As is the case there, the Dirichlet product is closely 
related to multiplication of Dirichlet series. 


Proposition 2.6. 
D)(s)Dp(s) = Dyxp(s) - 
Proof. The calculation is just like that of Proposition 2.5. 
ACh) p(9) 
D S D Ss) = = 
a(8) Dols) = (0 ae) | a) 


ED Nee) ps = Pavol. 


hoe =f 
We now proceed to calculate the average value of @(f). We have seen that 


=|f\][a-|P*). 


P\f 


Define A(f) = | f|. A moment’s reflection shows that the right hand side of 
the above equation can be rewritten as ) 7/4), u(9)|f/9| = (u*A)(f). Thus, 
by Proposition 2.6 we find 


Da(s) = Dusr(s) = Du(s)Da(s) = Ca(s)*Ca(s — 1) - (6) 
Proposition 2.7. 
N° Of) =P"(1-q"'). 


deg f=n 
f monic 


Proof. Let A(n) be the left-hand side of the above equation. Then, with 
the usual transformation u = q~* , Equation 6 becomes 


a n 1—qu 
S— A(n)u = To g2a 
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Now, expand (1 — q?u)~! into a power series using the geometric series, 
multiply out, and equate the coefficients of uw” on both sides. One finds 
A(n) = q*” — q*”"~!. The result follows. 


Finally, we want to do a similar analysis for the function o(f). Let 1(f) 
denote the function which is identically equal to 1 on all monics f. For any 
complex valued function \ on monics, we see immediately that (1*A)(f) = 


digit (9). In particular, if A(f) = |f|, then (1 * A)(f) = o(f). Thus, 


D,(s) = Disa(s) = Di(s)Da(s) = Ca(s)Ca(s — 1) . (7) 
Proposition 2.8. 


n—l 


S> off) = 


1—aqr- 1 
deg(fy=n q 


f monic 


Proof. Define S(n) to be the sum on the left hand side of the above 
equation. Then, making the substitution u = q~* in Equation 7 we find 


S° S(n)u™ = (1 —qu) (1 —q?u)7!. 


Expanding the two terms on the right using the geometric series, multiply- 
ing out, and collecting terms, we deduce 


S(n)= So ghd”. 


k+l=n 
The result follows after applying a little algebra. 


The method of obtaining average value results via the zeta function has 
now been amply demonstrated. The reader who wants to pursue this fur- 
ther can consult the original article of Carlitz [1]. Alternatively, it is an 
interesting exercise to look at Chapter VII of Hardy and Wright [1] or 
Chapter 3 of Apostol [1] , formulate the results given there for Z in the 
context of the polynomial ring A = F[T], and prove them by the methods 
developed above. 

In Chapter 17, we will return to the subject of average value results, but 
in the broader context of global function fields. 


Exercises 


1. Let f € A be a polynomial of degree at least m > 1. For each N > 
m show that the number of polynomials of degree N divisible by 
f divided by the number of polynomials of degree N is just |f|~1. 
Thus, it makes sense to say that the probability that an arbitrary 
polynomial is divisible by f is | f|~+. 
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10. 


11. 


12. 


13. 
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. Let Pi, Po,...,P € A be distinct monic irreducibles. Give a proba- 


bilistic argument that the probability that a polynomial not be divis- 
ible by any P? for 1 =1,2,...,t is give by T10 —|P,|~?). 


. Based on Exercise 2, give a heuristic argument to show that the prob- 


ability that a polynomial in A is square-free is given by ¢4(2)7?. 


. Generalize Exercise 3 to give a heuristic argument to show that the 


probability that a polynomial in A be k-th power free is given by 
Ca(k)~*. 


. Show 5~* |m|~! diverges, where the sum is over all monic polynomials 


MEA. 


. Use the fact that every monic m can be written uniquely in the form 


m = mom? where mo and m,; are monic and mp is square-free to 
show )~|mo|~! diverges where the sum is over all square-free monics 
Mog. 


. Use Exercise 6 to show 


I] @+lPl')7 0 a doo. 


P irreducible 
deg P<d 


. Use the obvious inequality 1+2 < e” and Exercise 7 to show 5*>|P|71 


diverges where the sum is over all monic irreducibles P € A. 


. Use Theorem 2.2 to give another proof that 5~|P|~! diverges. 


Suppose there were only finitely many monic irreducibles in A . De- 
note them by {P,, Po,...,P,}. Let m = P,P2...P, be their product. 
Show ®(m) = 1 and derive a contradiction. 


Suppose / is a complex valued function on monics in A and that the 
limit as n tends to infinity of Ave,(h) is equal to a. Show 


i — n\-1 _ 
Jim (1 + q+ + q”) p> h(f)=a. 
deg f<n 


Let pp(m) be the Mobius function on monic polynomials which we 
introduced in the text. Consider the sum ) J go. man U(™) over monic 
polynomials of degree n. Show the value of this sum is 1 if n = 0, —q 
ifn=1, and Oifn>1. 


For each integer k > 1 define o,(m) = do pm |f|*. Calculate Aven (cx). 


14. 


15, 


16. 
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Define A(m) to be log |P| ifm = P", a prime power, and zero other- 
wise. Show 

> ACA) = log |m| . 

f\|m 


Show that 
Dy(s) = ~Ca(s)/Ca(s). 
Use this to evaluate ) J ycg man A(™). 


Recall that d(m) is the number of monic divisors of m. Show 


Use this to evaluate So ace man d(m)’. 


3 
‘The Reciprocity Law 


Gauss called the quadratic reciprocity law “the golden theorem.” He was 
the first to give a valid proof of this theorem. In fact, he found nine differ- 
ent proofs. After this he worked on biquadratic reciprocity, obtaining the 
correct statement, but not finding a proof. The first to do so were Eisen- 
stein and Jacobi. The history of the general reciprocity law is long and 
complicated involving the creation of a good portion of algebraic number 
theory and class field theory. By contrast, it is possible to formulate and 
prove a very general reciprocity law for A = F(T] without introducing much 
machinery. Dedekind proved an analogue of the quadratic reciprocity law 
for A in the last century. Carlitz thought he was the first to prove the gen- 
eral reciprocity law for F[T’]. However O. Ore pointed out to him that F.K. 
Schmidt had already published the result, albeit in a somewhat obscure 
place (Erlanger Sitzungsberichte, Vol. 58-59, 1928). See Carlitz [2] for this 
remark and also for a number of references in which Carlitz gives different 
proofs the reciprocity law. We will present a particularly simple and elegant 
proof due to Carlitz. The only tools necessary will be a few results from 
the theory of finite fields. 


Let P € A be an irreducible polynomial and d a divisor of g — 1 (recall 
that q is the cardinality of F). If a € A and P does not divide a, then, by 
Proposition 1.10, we know x? =a (mod P) is solvable if and only if 


|P|—1 


a @ =1 (mod P). 


The left-hand side of this congruence is, in any case, an element of order 


dividing d in (A/PA)*. Since F* — (A/PA)* is one to one, there is a 
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unique a@ € F* such that 


a @ =a (mod P). 


Definition. If P does not divide a, let (a/P)q be the unique element of IF* 
such that Pia a 
q ¢ = (5), (mod P). 


If Pla define (a/P)q = 0. The symbol (a/P)q is called the d-th power 


residue symbol. 


When d = 2, this symbol is just like the Legendre symbol of elementary 
number theory. The situation is a bit more flexible in A since A* = E* is 
cyclic of order gq — 1, whereas Z* is just {+1}. Notice that the value of the 
residue symbol is in the finite field F and not in the complex numbers. 


Proposition 3.1. The d-th power residue symbol has the following prop- 
erties: 


1) (#),=(B), #a=6 (mod P). 


2) (?), ~ (2) (4), 
3) (4), = 1 iffx* =a (mod P) is solvable. 


4) Let ¢e E* be an element of order dividing d. There exists ana € A 
h that (3) =. 
such tha (+) ; 4 


Proof. The first assertion follows immediately from the definition. The sec- 
ond follows from the definition and the fact that if two constants are congru- 
ent modulo P then they are equal. The third assertion follows from the def- 
inition and Proposition 1.10. Finally, note that the map from (A/PA)* — 
F* given by a — (a/P)q is a homomorphism whose kernel is the d-th pow- 
ers in (A/PA)* by part 3. Since (A/PA)* is a cyclic group of order | P| —1, 
the order of the kernel is (|P| — 1)/d. Consequently, the image has order d 
and part 4 follows from this. 


It is an easy matter to evaluate the residue symbol on a constant. 


Proposition 3.2. Leta éF. Then, 


(g) <0 own 
d 


Proof. Let 6 = deg(P). Then, 


IP|-1  @- 
dd 
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The result now follows from the definition and the fact that for all a € F 
we have a? = a. 


Notice that if d|deg(P) every constant is automatically a d’th power 
residue modulo P. 
We are now in a position to state the reciprocity law. 


Theorem 3.3. (The d-th power reciprocity law) Let P and @ be monic 
irreducible polynomials of degrees 6 and v respectively. Then, 


(5),= 9"), 


Proof. Let’s define (a/P) = (a/P)g-1. Then (a/P)a = (a/P)*=. The 


theorem would follow in full generality if we could show 


Q _ bv =) 
(>) = (-1) (5 
since the general result would follow by raising both sides to the (¢ —1)/d 
power. 
Let a be a root of P and £ a root of Q. Let F’ be a finite field which 
contains F, a, and @. Using the theory of finite fields we find 


P(T) =(T —a)(T —a4)-.-(T-a®) and 
Q(L) = (L - p(T - 84) 7-6"). (1) 
We now take congruences in the ring A’ = F’|T]. Note that if f(T) € A’ 
we have f(T) = f(a) (mod (T — a)). Also note that if g(T) € A then 
g(T)? = g(T?) which follows readily from the fact that the coefficients of 
g(T) are in F. From this remark, and the definition, we compute that (Q/P) 
is congruent to 


Q(T) tate +P" = Q(T)Q(T4)--- Q(T”) 


Q(a)Q(a7)---Q(at ) (mod (1 — a)). 


By symmetry this congruence holds modulo (T— a ) for all i and it follows 
that it holds modulo P. Combining this result with Equation 1 yields the 
following congruence: 


6—lv—1 


(Z )= IT [[e* — BT) (mod P), 


t=0 j=0 
Both sides of this congruence are in F’ so they must be equal. Thus, 


6—lv—l 


(5) = HL Ther - 9") = 0" TE He" 0") = -9"(G). 


i=0 j=0 
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This concludes the proof. 


This beautiful proof is due to Carlitz. It is contained in a set of lecture 
notes for a course on polynomials over finite fields which he gave at Duke 
in the 1950s. We will outline another proof, also due to Carlitz, in the 
exercises to Chapter 12. 


As in the classical theory, it is convenient to extend the definition of the 
d-th power reciprocity symbol to the case where the prime P is replaced 
with an arbitrary non-zero element b € A. 


Definition. Let b € A, b # 0, and b = BQhQ!?...Qf be the prime 
decomposition of b. If a € A, define 


a - a \ fy 
*) -T](4)*. @) 
( b d jal ( Q; ), 

Notice that this definition ignores G = sgn(b) and so the symbol only 
depends on the principal ideal bA generated by b. The basic properties 
of this extended symbol are easily derived from those of the d-th power 
residue symbol. 


Proposition 3.4. The symbol (a/b)q has the following properties. 

1) Ifa, =a (mod Bb) then (a;/b)q = (a2/b)a. 

2) (aya2/b)a = (a1/b)a(a2/d)a. 

3) (a/byb2)a = (a/b1)a(a/b2)a. 

4) (a/b)q #0 iff (a,b) =1 (a is relatively prime to b). 

5) Ifx* =a (mod b) is solvable, then (a/b)g = 1, provided that (a, b) = 1. 


Proof. Properties 1 — 4 follow from the definition and the properties of the 
symbol (a/P)q. 

To show property 5, suppose c 
2, (a/b)a = (c4/b)a = (c/b)4 = 1. 


The converse of assertion 5 in Proposition 3.4 is not true in general. For 
example, suppose Q is a monic irreducible not dividing a and b = Q?. Then, 
by property 3 above we have (a/b)g = (a/Q*)4a = (a/Q)4 = 1. However, 
not every element of (4/bA)* = (A/Q‘A)* is a d-th power. In fact, the 
group of d-th powers has index d as we saw in Proposition 1.10. 

The same example shows that property 4 of Proposition 3.1 doesn’t hold 
for the generalized symbol. As a mapping from (A/Q?¢A)* — F* the symbol 
(a/Q*)a only takes on the value 1 and no other element of order divi- 
ding d. 

It is useful to have a form of the reciprocity law which works for arbitrary 
(i.e., not necessarily monic or irreducible) elements of A. For f €¢ A, f 40, 
define sgn,(f) to be the leading coefficient of f raised to the to power. 


¢ =a (mod b). Then, by properties 1 and 
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Theorem 3.5. (The general reciprocity law). Let a,b € A be relatively 
prime, non-zero elements. Then, 


a b -i qi a € é€ — aeg(a 
(ala). = (—1) “a 4e8(@) 480) son (a) #8) sen g(b)~ O°8(), 
Proof. When a and b are monic irreducibles this reduces to Theorem 3.3. In 
general, the proof proceeds by appealing to Proposition 3.2, Theorem 3.3, 
the definitions, and the fact that the degree of a product of two polynomials 
is equal to the sum of their degrees. We omit the details. 


The reciprocity law can be thought of as a pretty formula, but its im- 
portance lies in the fact that it relates two natural questions in an intrinsic 
way. Given a polynomial m of positive degree, what are the d-th powers 
modulo m? Since (A/mA)* is finite, one can answer this question in prin- 
ciple by just writing down the elements of (A/mA)*, raising them to the 
d-th power, and making a list of the results. The answer will be a list of 
cosets or residue classes modulo m. In practice this may be hard because 
of the amount of calculation involved. One can appeal to the structure of 
(A/mA)* to find shortcuts. Parenthetically, it is an interesting question to 
determine the number of d-th powers modulo m. Recall that we are as- 
suming d|(q—1). Under this assumption, the answer is ®(m)/ d\™) | where 
A(m) is the number of distinct monic prime divisors of m. This follows from 
Proposition 1.10 and the Chinese Remainder Theorem. 

Now, let’s turn things around somewhat. Given m, find all primes P such 
that m is a d-th power modulo P. It turns out that there are infinitely many 
such primes, so that it is not possible to answer the question by making a 
list. One has to characterize the primes with this property in some natural 
way. This is what the reciprocity law allows us to do. 

For simplicity, we will assume that m is monic. It is no loss of gener- 
ality to assume that all the primes we deal with are monic as well. Let 


{a 1,@2,...,a4} be coset representatives for the classes in (A/mA)* which 
have the property (a/m)g = 1. If there is a b € A such that (b/m)g = —1 
let {b;, bz,..., b¢} be coset representatives for all classes with this property. 


Proposition 3.6. With the above assumptions we have 


1) If deg(m) is even, (q¢ —1)/d is even, or p = char(F’) = 2, m ts a d-th 
power modulo P iff P =a, (mod m) for somei=1,2,...,. 

2) If deg(m) is odd, (¢ —1)/d is odd, and p = char(F) is odd, then m 
is a d-th power modulo P iff either deg(P) is even and P = a, (mod m) 
for some i = 1,2,...,¢ or deg(P) is odd and P = b; (mod m) for some 
1=1,2,...,t. 


Proof. By Theorem 3.5, we have 


(B),= career (2) 
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If any of the conditions in Part 1 hold, we have (m/P)q = (P/m)q and this 
gives the result by Part 3 of Proposition 3.1 and the fact that (P/m)q only 
depends on the residue class of P modulo m. 

If the conditions of Part 2 hold, then (m/P)q = (—1)9°8\)(P/m)q. Thus, 
if deg(P) is even, (m/P)q = 1 iff P = a; (mod m) for some i, and if deg(P) 
is odd, (m/P)q = 1 iff P = b; (mod m) for some 7. That there isabe A 
with (b/m)q = —1 under the conditions of Part 2 follows from the fact that 


(—) _ (—1) *a dea(m) ~_]. 
d 


A number of interesting number-theoretic questions are of the following 
form: if a certain property holds modulo all but finitely many primes, does 
it hold in A? One such property is that of being a d-th power. In this case 
the question has a positive answer. The key to the proof, as we shall see, 
is the reciprocity law. 


Theorem 3.7. Let m € A be a polynomial of positive degree. Let d be an 
integer dividing q —1. If xt = m (mod P) is solvable for all but finitely 
many primes P, then m = m4 for some m, € A. 


Proof. Let m = uQ)'Qs”...Q¢* be the prime decomposition of m. We 
begin by showing that if some e; is not divisible by d, then there are in- 
finitely many primes L such that (m/L)q 4 1. This will contradict the 
hypothesis and we can conclude that the hypothesis implies m = wm/@ for 
some mi, € A. 

We may as well assume that e, is not divisible by d. Let {Z,, L2...., Ls} 
be a set of primes not dividing m such that (m/L;)aqa 41 for j =1,2,...,s. 
For any a € A we have 


t 
a a \& 

Ce = Wg) (3) 

By Part 4 of Proposition 3.1, there exists an element c € A such that 
(c/Qi)a = Ca, a primitive dth root of 1. By the Chinese Remainder Theo- 
rem, we can find ana € A such that a =c (mod Q)) and a=1 (mod Q;) 
for 1 > 2, and a = 1 (mod L,) for all 7. Once such an a is chosen we can 
add to it any A-multiple of Q:Q....Q,L,L2...L, and it will satisfy the 
same congruences as a. ‘Thus we may assume, by choosing a suitable such 
multiple of large degree, that a is monic and of degree divisible by 2d. As- 
suming that a has these properties, we substitute it into Equation 3 and 


derive 
a 


(a). =a #1 


By the reciprocity law, 
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It follows that there must be a prime Lla such that (m/L)q # 1. Since 
a = 1 (mod L;) for every 7 we must have L 4 L, for all 7. This shows 
there must be infinitely many primes L such that (m/L)q 4 1 if e; is not 
divisible by d. The same assertion holds for each e;. 

We have shown that under the hypothesis of the theorem m = pm’, 
where p € F*. It remains to show that must be a d-th power. Consider 


(B= (B) ee om” ‘ 


By Theorem 2.2, there are infinitely many irreducibles of degree relatively 
prime to d. In fact, there are irreducibles of every degree. Thus there is 
an irreducible P of degree prime to d and such that (m/P)q = 1. It then 
follows from equation number (4) that pia deg P = 1 and so, wir = 1. 
This shows that py is a d-th power, p = y?%, in F. Set m, = om!, and we 
have m = m4, as asserted. 

In the statement and proof of Theorem 3.7 we have been assuming that 
d divides q —1. Is this necessary? The statement of the theorem is not true 
for all d. For example, consider p = char(F). For every prime P and any 
a € A we have that a is a p-th power modulo P. This follows from the fact 
that raising to the p-th power is an automorphism of the finite field A/PA. 
Thus, the theorem fails if d = p or indeed if d is a power of p. However, 


Fact. The assertion of Theorem 3.7 remains true if p does not divide d. 
In other words, if d is not divisible by p it is not necessary to assume that 
d|q—1. 


We will sketch a proof. We rely on Theorem 3.7 together with some 
elementary facts about finite fields. 

Since p does not divide d, q and d are relatively prime. Thus, there is a 
positive integer n such that q” = 1 (mod d). Let F’ be a field extension of 
F of degree n. F’* has gq” — 1 elements and so must contain a primitive d-th 
root of unity. Set A’ = F(T). 

Now, suppose that m € A and that m is a d-th power for all but finitely 
many primes P of A. If P’ is a prime of A’ it is easy to check that P’A/NA = 
PA where P is a prime of A. It follows that m is a d-th power modulo all 
but finitely many primes of A’. Invoking Theorem 3.7, we see that m = m4 
is a d-th power in A’. We need to show that m’ can be chosen to be in A. 

Let P be a prime of A and consider it as an element of A’. It factors as 
a product of primes in A’; P = P/P}.--P! where the P! are all distinct 
( over a finite field, every irreducible polynomial has no repeated roots in 
any algebraic extension ). For a prime P of A, let e be the highest power 
to which P divides m. If P’ is a prime of A’ dividing P, then e is also the 
highest power of P’ dividing m. Since m = m’@, unique factorization in A’ 
implies dle. This being true for all primes P of A, it follows that m = pm? 
with m, € A and uw € F. It remains to show that p is a d-th power in F*. 
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By the hypothesis on m and the equation m = pm we see that p is 
a d-th power for all but finitely many primes P. Let d’ = (d,q—1). p is 
a d’-th power for all but finitely many primes P (since d’\d). Moreover, 
d'\(q— 1). Using Theorem 3.7 once again, we see that ys is a d’-th power. 
Since F* is cyclic of order gq — 1 it is easy to see that F** = Ere Thus, pL 
is a d-th power, and we are done. 


Exercises 


1. 
2. 
3. 


Fill in the details of the proof of Proposition 3.4. 
Fill in the details of the proof of Theorem 3.5. 


Suppose d | q — 1 and that m € A is a polynomial of positive de- 
gree. Show that the number of d-th powers in (A/mA)* is given by 
®(m) /d*“™, where A(m) is the number of distinct monic prime divi- 
sors of m. 


. Let P € A be a prime and consider the congruence X* = —1 


(mod P). Show this congruence is solvable except in the case where 
q = 3 (mod 4) and deg P is odd. 


. Suppose d’ | q—1 and a € F* is an element of order d’. Let P € A be 


a prime of positive degree and suppose that d is a divisor of |P| — 1. 
Show that X* = a (mod P) is solvable if and only if dd’ divides 


|P| — 1. Show how Exercise 4 is a special case of this result. 


. Suppose that d is a positive integer and that gq = 1 (mod 4d). Let 


P € Abe a monic prime. Show that X¢ = T (mod P) if and only if 
the constant term of P, i.e. P(O), is a d-th power in F. 


. Suppose d divides g — 1 and that P € A is a prime. Show that the 


number of solutions to X¢ =a (mod P) is given by 


Om ae 


. Let b € A and suppose b = GP; Ps? .-- P’* is the prime decomposi- 


tion of b. Here, G € F* and the P; are distinct monic primes. Con- 
sider (a/b)q as a homomorphism from (A/bA)* to the cyclic group 
< Ca > generated by an element ¢y € F* of order d. Show that this 
map is onto if and only if the greatest common divisor of the set 
{e€1,€2,...,e¢} is relatively prime to d. 


Suppose d | qg—1 and a, bi, bz € A. Show that (a/b1)¢ = (a/b2)q if the 
following conditions hold: b; = bz (mod a), deg b; = deg bg (mod 4d), 
and sgnq(b1) = sgng(b2). 
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10. In this exercise we give an analogue of the classical Gauss criterion 
for the Legendre symbol. Let P € A be a prime. Show that every non- 
zero residue class modulo P has a unique representative of the form 
pum where p € F* and m is a monic polynomial of degree less than 
deg P. Let M denote the set of monics of degree less than deg P. 
Suppose a € A with P { a. For each m € M write am = pmm’ 
(mod P) where fm € F* and m’ € M. Show 


In the exercises to Chapter 12, we will use this criterion to outline another 
proof of the Reciprocity Law (also due to Carlitz). 


4 


Dirichlet L-Series and Primes 
in an Arithmetic Progression 


Our principal goal in this chapter will be to prove the analogue of Dirichlet’s 
famous theorem about primes in arithmetic progressions. This was first 
proved by H. Kornblum in his PhD thesis written, just before the onset 
of World War I, under the direction of Edmund Landau. After completing 
the work on his thesis, but before writing it up, Kornblum enlisted in the 
army. He died in the fighting on the Eastern Front. After the war, Landau 
completed the sad duty of writing up and publishing his student’s results, 
see Kornblum [1]. 

The proof of the theorem uses the theory of Dirichlet series. After giving 
the definitions and proving the elementary properties of these series, we 
outline the connection with primes in arithmetic progressions and isolate 
the main difficulty which is the proof that D(1,y) # 0 for non-trivial char- 
acters x. We then give a proof of this fact which differs from the Kornblum- 
Landau approach. It is an adaptation to polynomial rings of a proof of the 
corresponding number-theoretic fact due to de la Vallee Poussin. Finally, 
to complete the chapter, we give a refinement of Dirichet’s theorem, which 
shows that given an arithmetic progression {a+mz | a,m € A, (a,m) = 1}, 
then, for all sufficiently large integers N, there is a prime P of degree N 
which lies in this arithmetic progression. 

Before beginning we discuss the notion of the Dirichlet density of a set 
of primes in A. This will give a quantitative measure of how big such a set 
is. Let f(s) and g(s) be two complex valued functions of a real variable s 
both defined on some open interval (1,0). We define f ~ g to mean that 
J —g remains bounded as s — 1 inside (1, d). 
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Proposition 4.1. We have 


lor ¢a(s) ~log (>) = sie 


where the sum is over all irreducible monic polynomials P. 


Proof. Since ¢4(s) = (1 — q!~*)7! we see that lim,51:(s — 1)¢4(s) = 
1/log(q). Thus, log ¢4(s) — log(s — 1)~1 is bounded as s — 1, which estab- 
lishes the first relation. As for the second relation we see, using the Euler 
product for ¢,4(s) 


log Ca(s =~ Di loat (1-|P|~*) = IPE “/k = 2 IPI + dS PIM /k. 


P,k>2 


Now, Yin>2|PIW"*/k < Digs |PI™® = [PI (1 — |PI-8)* < 2|PI-*. 
Thus the last sum in the above equation is bounded by 2¢,4(2). This shows 
that log Ca(s) ¥ })p|P|~* which completes the proof. 


Definition. Henceforth the word “prime” will denote a monic irreducible 
in A. Let S be a set of primes in A. The Dirichlet density of S, 6(S) is 
defined to be 


provided that the limit exists. The limit is assumed to be taken over the 
values of s lying in a real interval (1, ). 

Several remarks are in order. First note that 0 < 6(S) <1 and if S = 
S; US, then 6(S) = 6(S;) + 6(S2) provided S; and Sz both have densities 
and are disjoint. Thus, Dirichlet density is something like a probability 
measure. One must not carry this too far, however. Dirichlet density is not 
countably additive. 

It is obvious that the Dirichlet density of a finite set is zero. Thus, if the 
Dirichlet density of a set exists and is positive, we are assured that the set 
is infinite. One of the two main results of this chapter asserts that if a and 
m are relatively prime polynomials, then the Dirichlet density of the set 
S={P¢€A|P prime, P = a (mod m)} exists and is equal to 1/@(m). 
It is in this refined form that we prove Dirichlet’s famous theorem in the 
context of the polynomial ring A. 

The next step is to introduce the main tools necessary to the proof, 
Dirichlet characters and Dirichlet L-series. 

Let m be an element of A of positive degree. A Dirichlet character modulo 
m is a function from A — C such that 


(a) x(a + bm) = x(a) for all a,b € A. 
(b) x(a)x(b) = x(ab) for alla,be A 
(c) x(a) £ 0 if and only if (a,m) = 1. 
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A Dirichlet character modulo m induces a homomorphism from 
(A/mA)* — C* and conversely, given such a homomorphism there is a 
uniquely corresponding Dirichlet character. The trivial Dirichlet character 
Xo is defined by the property that x.(a) = 1 if (a,m) =1 and x.(a) = 0 if 
(a,m) #1. 

It can be shown that there are exactly ®(m) Dirichlet characters modulo 
m which is the same cardinality as that of the group (A/mA)*. Let Xm be 
the set of Dirichlet characters modulo m. If v,~ € X,, define their product, 
xp, by the formula yv(a) = y(a)¥(a). This makes X,, into a group. The 
identity of this group is the trivial character y,. The inverse of a character 
is given by y~!(a) = x(a)~? if (a,m) = 1, and x~*(a) = 0 if (a,m) 4 1. 
It can be shown, but we will not do so here, that Xm is isomorphic to 
(A/mA)*, which is a much better result than the bare statement that they 
have the same number of elements. This is a special case of a general result 
which asserts that a finite abelian group G is isomorphic to its character 
group G, see Lang [4], Chapter 1, Section 9. 

Another definition is useful. If y € X,, let x be defined by x¥(a) = x(a) 
= complex conjugate of x(a). Since the value of a character is either zero 
or a root of unity, it is easy to see that ¥ = y~!. Moreover, we have the 
following very important proposition, the orthogonality relations. 


Proposition 4.2. Let y and w be two Dirichlet characters modulo m and 
a and b two elements of A relatively prime to m. Then 


(1) 7, x(a)v(a) = &(m)65(x, P). 


(2) D2, x(a)x(b) = &(m)6(a, b). 


The first sum is over any set of representative for A/mA and the second 
sum is over all Dirichlet characters modulo m. By definition, 6(v, wv) = 0 
wxAw andl ify=w. Similarly, 6(a,b) =0 ifaA~b and 1ifa=b. 


The proofs of all these facts are standard. For the corresponding facts 
over the integers, Z, the reader can consult, for example, Ireland-Rosen [1], 
Chapter 16, Section 3. The relations given in the above proposition are 
called the orthogonality relations. 


Definition. Let y be a Dirichlet character modulo m. The Dirichlet L- 
series corresponding to x is defined by 


From the definition and by comparison with the zeta function ¢,4(s) 
one sees immediately that the series for L(s,) converges absolutely for 
¥(s) > 1. Also, since characters are multiplicative we can deduce that the 
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following product decomposition is valid in the same region. 
x(P)\~1 
L(s,x) = [(2 - ) ; 


An immediate consequence of this product decomposition is the fact 
that the L-series corresponding to the trivial character is almost the same 
as Ca(s). More precisely, 


L(8,xo) = [] (t= pz) eats) 


P\|m 


This shows that L(s,y,) can be analytically continued to all of C and 
has a simple pole at s = 1 since the same is true of ¢,4(s). On the other 


hand, 


Proposition 4.3. Let y be a non-trivial Dirichlet character modulo m. 
Then, L(s,x) is a polynomial in q~* of degree at most deg(m) — 1. 


Proof. Define 
A(n,x)= > xf): 
deg(f) =n 


f monic 


It is clear from the definition of L(s, x) that 


L(s,x) = S> A(n,x)q°". 
n=0 


Thus, the result will follow if we can show that A(n,x) = 0 for all n > 
deg(m). 

Let’s assume that n > deg(m). If deg(f) = n, we can write f=hm-+r, 
where r is a polynomial of degree less than deg(m) or r = 0. Here, h is a 
polynomial of degree n—deg(m) > 0, whose leading coefficient is sgn(m)~! 
(since f is monic). Conversely, all monic polynomials of degree n > deg(m) 
can be uniquely written in this fashion. Since y is periodic modulo m and 
since h can be chosen in g”~?°8(™) ways, we have 


A(n, x) = q?S8 Sy (r) = 0, 


T 


by the first orthogonality relation (Proposition 4.2, part (1) ) since y # 
Xo, and the sum is over all r with deg(r) < deg(m), which is a set of 
representatives for A/mA. 


Proposition 4.3 shows that if y is non-trivial, then L(s,y) which was 
initially defined for #(s) > 1 can be analytically continued to an entire 
function on all of C. We have already seen that L(s,y,) can be analytically 
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continued to all of C with a simple pole at s = 1. These facts are much 
harder to establish when working over Z rather than A. 

In the proof of Dirichlet’s theorem on primes in arithmetic progressions 
the most difficult part is the proof that L(1, x) 4 0 if x is non-trivial. This 
turns out to be substantially easier in function fields because the L-series 
are essentially polynomials. We begin with a lemma. 


Lemma 4.4. Let y vary over all Dirichlet characters modulo m. Then, for 
each prime P not dividing m, there exist positive integers fp and gp such 
that fpgp = ®(m) and 


[[ 4s.x) = T]@- ler’). 


Ptm 


Proof. For a fixed prime P not dividing m, the map y > x(P) is a 
homomorphism from the group X,, — C*. The image must be a cyclic 
group of order fp, say, generated by ¢;,. If gp is the order of the kernel, 


clearly fpgp = ®(m). 
With these preliminaries, we calculate for fixed P. 


fe-1 
[[@-x(P)PI-*) = II (1—¢5,|P|-*)9” = (1 —|P|-f?*)9”. 


Now take the inverse of both sides, multiply over all P, and the lemma 
follows. 


Lemma 4.5. Suppose x is a complex Dirichlet character modulo m , i.e. 
X#X. Then, L(1,x) £0. 


Proof. The right-hand side of the equation in the statement of Lemma 4.4 
is equal to a Dirichlet series with positive coefficients and constant term 
1. Consequently, its value at real numbers s such that s > 1 is a real 
number greater than 1. Suppose xy is a complex Dirichlet character and 
that L(1,x) =0. Then, by complex conjugation we see L(1,X) = 0 as well. 
In the product IL, D(s, x) the term corresponding to the trivial character 
has a simple pole at s = 1. All the other terms are regular there and two 
of them have zeros. Thus, the product is zero at s = 1. This contradicts 
the fact, established above, that for all s > 1 the value of the product is 
greater than 1. Thus, L(1, x) 4 0, as asserted. 


The next step is to deal with real-valued characters. It is not hard to see 
that these coincide with characters of order 2. The proof for such characters 
will be a modification of a proof of the classical case due to de la Vallée 
Poussin. 

Assume now that x has order 2 and consider the function 


L(s, x0) L(8,x)_ 


G(s) = L(2s, xo) 
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This can be written as a product over all monic irreducibles not dividing 
m. Let P be such a prime. Then x(P) = +1. The factor of the above series 
corresponding to P is 

(1 = |P|7*)* = x(P)PL 8) 
(1 — |P|-*8)~? 


If y(P) = —1 this whole factor reduces to 1. If x(P) = 1 it simplifies to 


(1+ |P|-*) — —ks 
~________— = 1492 P 
(— |PP*) d'P 


It follows from these remarks that G(s) is a Dirichlet series with non- 
negative coefficients. This will shortly play a crucial role. 

First, we look more carefully at L(s, xo0)/L(2s, xo). As we have already 
seen, 


L(s, x0) = [[ 1 -IPI-*)¢a(s) = T]-1P) a 


1—s 
P|m P|m q 


A short calculation shows 


L(s, x0) _ uke 4 \P|-8)72 1 —q' 


L(Qs, Xo) Pim 1- qi-s 
From this identity and what we have already proven about G(s) we deduce 
that ane 
1 __ _ S) 
A _ => =~ a 
a Dirichlet series with non-negative coefficients. 


It is now convenient to switch to a new variable, u = q~*. The above 
equation becomes 


where L*(u, vy) is a polynomial in u by Proposition 4.3, and 


A(d)= > a(n) 
n,deg(n)=d 


is non-negative for all d > 0 and A(0) = 1. The Dirichlet series converges 
for Re(s) > 1 which implies the power series in u converges for |u| < q7!. 
Finally, notice that s = 1 corresponds to q~! so what we are trying to prove 
is that L*(q~', y) 4 0. We now have developed everything we need to give 
a quick proof of this. 
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We argue by contradiction. Suppose L*(q~!, x) = 0. Then (1—qu) divides 
D*(u, x) and the left-hand side of the above equation is a polynomial in wu. 
It follows that the right-hand side is a polynomial in u with non-negative 
coefficients and constant term 1. It therefore cannot have a positive root. 
However, the left-hand side vanishes when u = 1/,/q. This is a contradic- 
tion, so L*(q~', x) #0 and thus, L(1,) 4 0. We have proven the following 
key result. 


Proposition 4.6. Let y be a non-trivial Dirichlet character modulo m. 


Then, L(1,x) #0. 


From Proposition 4.6 and previous remarks we see that as s > 1 with s 
real and greater than 1 we have 


lim log L(s,7x.) =co and lim log L(s,x) exists, for x # xo. 
sl sl 


Here, and in what follows we take for log(z) the principal branch of the 
logarithm. 


Theorem 4.7. Let a,m € A be two relatively prime polynomials with m of 
positive degree. Consider the set of primes, S = {P€ A|P=a(modm)}. 
Then, 6(S) =1/®(m). In particular, S is an infinite set. 


Proof. Using the product formula for L(s,y) and the same technique used 
in the proof of Proposition 4.1, one finds 


P 
log L(s, x) = oe + R(s,x) , 
P 


where the function R(s, x) is bounded as s tends to 1 from above. Multipy 
both sides by ¥(a) and sum over all x. Using the orthogonality relation for 
Dirichlet characters, Proposition 4.2, part (2), we obtain 


S| X(a) log L(s, x) = &(m) IPP 
x P=a (mod m) 
where R(s) is a function which remains bounded as s = 1. 

Divide each summand on the left-hand side of the above equation by 
>/p |P|~* and let s tend to 1 from above. By Proposition 4.1 and the 
remarks preceding the theorem, the summand corresponding to the trivial 
character tends to 1, while each summand corresponding to a non-trivial 
character tends to zero. If we divide the right-hand side by }),|P|~* and 
let s tend to 1 from above, we get ®(m)d(S). The result follows. 


Theorem 4.7 is the original form of Dirichlet’s theorem. It is possible, 
with more work, to prove a much stronger form of the theorem. Suppose 
a,m € A are relatively prime and that m has positive degree. Consider the 
set of primes 


Sn(a,m)={Pe€A]|P=a(modm), deg(P) = N}. 
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We claim that for all large integers N this set is not empty. The following 
theorem proves this and more. 


Theorem 4.8. 


1 @N 2 
#SnN(a,m) = Sim) N +O (5) ; 


It will take us several steps to prove this result, but first, a remark. Let 
Sy be the set of primes of degree N. We have seen (Theorem 2.2) that 


This is a natural density analogue to the Dirichlet density form of the 
main theorem. 


Proof of Theorem 4.8. The idea of the proof is to realize that the L- 
series L(s, x) can be expressed as a product in two ways. One way, which we 
have already considered, is as an Euler product. The other is as a product 
over its complex zeros. This is made easier by rewriting, as we have done 
before, everything in terms of the variable u = q~°. If x is not trivial, then 
by Proposition 4.3, D(s, x) is a polynomial in g~* of degree at most M —1 
where M = deg(m). We have 


M-1 M-1 
D*(u,x) = S- ak = [| ( 1—a;(x)u) . (1) 
k=0 i=1 


The second expression for L*(s, x) comes from rewriting the Euler prod- 
uct for D(s, x) in terms of u. We first regroup the terms in the Euler prod- 
uct. 


L(s,x)=][ a-xPlerm)t= TE [TT a-x\c%*)? 


Ptm d=1 Ptm 
deg(P)=d 


Now, make the substitution wu = q~*. We obtain the expression 


Leuxy=]] Jf G@-x(P)u4)?. (2) 
Ptm 
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Our intention is to take the logarithmic derivative of both expressions, 
write the results as power series in u and compare coefficients. Afterwards 
we apply the orthogonality relations to isolate the primes congruent to a 
modulo m. However, in addition to the algebra involved, we will have to 
do a number of estimates. One of these estimates will involve invoking a 
deep result of A. Weil. The others are more elementary. 

We begin by writing down an identity which will be used repeatedly. 
Namely, 


CO 
ue (log(1 — au)7*) = So akul ; (3) 
k=1 
Here a is a complex number. The sum converges for all u such that |u| < 
la|~!. The proof of this identity is a simple exercise using the geometric 
series. 
For each character y modulo m define the numbers cy (x) by 


OO 


d > _ N 
u= log(L*(u, x) = 2 en(xu . 


We claim that 
cn(Xo) =q* +O(1) and that ey(x) = O(¢ 


The easy case is when x = yo. Recall that 


L(s,x0) = [] (1-IP|~*) ¢a(s) - 


P|m 


Thus, 


L*(u, Xo) _ [Ia _ yes P) 1 


P\lm 1 qu 


It now follows immediately, using Equation 3 and the additivity of the 
logarithmic derivative, that cv (x0) = ¢ +O(1). For x 4 xo, by combining 
Equation 1 with Equation 3 we find 


M-1 
ev (x) = — So a(x). 
k=1 


It follows from the analogue of the Riemann hypothesis for function fields 
over a finite field that each of the roots a,(x) has absolute value either 1 
or ,/q. This is the deepest part of the proof and is due to A. Weil (see Weil 
[1]). We will discuss it in some detail in the next chapter. In the Appendix 
to this book we will present an “elementary” proof, due to E. Bombieri, of 
this important result . Assuming it for now, we see immediately from the 
last equation that cy (x) = O(q%/2). Thus, we have verified both assertions 
of (4) above. 
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It should be remarked that one can prove much more easily, a weaker 
result than the Riemann hypothesis which has the effect of replacing the 
error term in the theorem with O(q°") where 6 is some real number less 
than 1. This still gives the corollary that the set Sj (a,m) is non-empty for 
all large N. We will indicate how to prove this in the next chapter. 

We now continue with the proof of the theorem. Consider the Euler 
product expansion of L*(s,x) given by Equation 2. Take the logarithmic 
derivative of both sides and multiply both sides of the resulting equation 
by u. Again using Equation 3 we find 


en(x)= >> degPx(P)*. 
k,P 
k deg P=N 


In the sum on the right-hand side separate out the terms corresponding 
to k = 1. The result is N 0 4., p-n X(P). The rest of the terms can be 


written as follows: 
Sod So xX(PyN/4. 


d|N deg P=d 
d<N/2 


The inner sum in absolute value is less than or equal to #{P € A| deg P = 
d} = q*/d + O(q?/2/d) by Theorem 2.2. Thus the double sum is bounded 
by 


N 
2 


l+qt¢ Pte tq 4 odtqt get: +q/4) = O(g?). 


We have proven 
). (5) 


Finally we compute the expression }7, ¥(a)en(x) in two ways. First we 
use Equation 5 and then we use Equation 4. 
From the orthogonality relations and Equation 5 we find 


1 _ N 
Bim) 2 X(aew(x) = N#Sy(a,m) +O(q?) . 
Next, from Equation 4 we see 


S> x(ajen(x) = 9% +O0(q7) 


So, we finally arrive at the main result: 
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Exercises 


1. 


. With the same notation as in Exercise 4, consider the coefficient of q7~ 


Let S = {P,, Po,...} be the set of monic primes in A. Let S; = {P;} 
be the set consisting of P; alone. Then, S = U?2,5;. Show that this 
implies that Dirichlet density is not countably additive. 


. Let P(T) € A and define N(U? = P(T)) to be the number of pairs 


(a, 3) € F x F such that 6? = P(a). Show that 


N(U? = P(T)) = S\(1+ P(a)**). 


ack 


. Suppose q is odd and let P € A is a monic irreducible of degree two 


and that x(a) = (a/P)2 for all a € A. Show that Da(s,v) =1tq-°. 
(Hint: Use the Reciprocity Law and Exercise 2). 


. In general, suppose P € A is a monic irreducible of positive degree 


and set v(a) = (a/P) 9. Show that 


>, x(a) =+(-N(U? = PT)). 


a monic 
deg a=1 


8s 


in L(s,x). Use Exercise 4 and the Riemann Hypothesis for function 
fields to prove 


IN(U? = P(T)) - q| < (deg P - 1) V4 - 


. Let h(T) € A be a polynomial of degree m with a non-zero constant 


term. Show that there are infinitely many primes in A whose first 
m+ 1 terms coincide with h(T). What is the Dirichlet density of this 
collection of primes? 


. Let {a1,a2,...,a } be the elements of F labeled in some order and 


choose elements 6; € F* fori =1,...,q , where repetition is allowed. 
Prove that thee are infinitely many primes, P(T), such that P(a;) = 
(8; fori =1,...,q. What is the Dirichlet density of this set of primes? 


O 


Algebraic Function Fields 
and Global Function Fields 


So far we have been working with the polynomial ring A inside the ratio- 
nal function field k = F(T). In this section we extend our considerations 
to more general function fields of transcendence degree one over a general 
constant field. This process is somewhat like passing from elementary num- 
ber theory to algebraic number theory. The Riemann-Roch theorem is the 
fundamental result needed to accomplish this generalization. We will give 
a proof of this fundamental result in Chapter 6. In this chapter we give 
the basic definitions, state the theorem, and derive a number of important 
corollaries. After this is accomplished, attention will be shifted to function 
fields over a finite constant field. Such fields are called global function fields. 
The other class of global fields are algebraic number fields. All global fields 
share a great number of common features. We introduce the zeta function 
of a global function field and explore its properties. The Riemann hypoth- 
esis for such zeta functions will be explained in some detail, and we will 
derive several very important consequences, among others an analogue for 
the prime number theorem for arbitrary global function fields. A proof of 
the Riemann hypothesis will be given in the appendix. In this chapter we 
will prove a weak version. This is enough to yield the analogue of the prime 
number theorem , albeit with a poor error term. In later chapters we will 
also explore L-functions associated to global function fields - both Hecke L- 
functions (generalizations of Dirichlet L-functions) and Artin L-functions. 

One final comment before we begin. Our treatment of this subject is 
very arithmetic. ‘The geometric underpinnings will not be much in evidence. 
The whole subject can be dealt with under the aspect of curves over finite 
fields. We have chosen the arithmetic approach because our guiding theme 
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in this book will be the exploration of the rich analogies that exist between 
algebraic number fields and global function fields. 

To begin with it is not necessary to restrict the constant field F’ to be 
finite. In fact, in this first part of the chapter we make no restrictions on F’ 
whatsoever. A function field in one variable over F is a field K, containing 
F and at least one element x, transcendental over F’, such that K/F (2) is a 
finite algebraic extension. Such a field is said to have transcendence degree 
one over F’. It is not hard to show that the algebraic closure of F' in K is 
finite over F’. One way to see this is to note that if EF is a subfield of kK, 
which is algebraic over F’, then [FE : F)] = (E(x): F(x)| < [K : F(z)]. So, 
replacing F’ with its algebraic closure in K, if necessary, we assume that 
F is algebraically closed in K. In that case, F’ is called the constant field 
of K. Note the following simple consequence of this definition. If F’ is the 
constant field of K and y € K is not in F,, then y is transcendental over 
F’, It is also true that K/F'(y) is a finite extension. To see this, note that 
y is algebraic over F(x) which shows there is a non-zero polynomial in two 
variables g( X,Y) € F|X,Y] such that g(x,y) = 0. Since y is transcendental 
over F we must have that g(X,Y) ¢ F[Y]. It follows that x is algebraic 
over F'(y). Since K is finite over F(x, y) and F(z, y) is finite over F'(y), it 
follows that K is finite over F'(y). 

A prime in K is, by definition, a discrete valuation ring R with maximal 
ideal P such that F C R and the quotient field of R equal to K. As a 
shorthand such a prime is often referred to as P, the maximal ideal of R. 
The ord function associated with R is denoted ordp(*). The degree of P, 
deg P, is defined to be the dimension of R/P over F' which can be shown to 
be finite. We sketch the proof. Choose an element y € P which is not in F’. 
By the deductions of the last paragraph, K/F(y) is finite . We claim that 
[R/P:F] < |K : F(y)]. To see this let u,,u2,...,Um € R be such that the 
residue classes modulo P, t1,tg,...,Um, are linearly independent over F’. 
We claim that uy, ug,...,Um are linearly independent over F'(y). Suppose 
not. Then we could find polynomials in y, {fi(y), fo(y),..-,fm(y)}, such 
that 


fi(y)ui + fo(yjue+:::+fm(y)um =0. 


It is no loss of generality to assume that not all the polynomials f;(y) are di- 
visible by y. Now, reducing this relation modulo P gives a non-trivial linear 
relation for the elements w; over F’, a contradiction. Thus, {u1,u2,...,Um} 
is a set linearly independent over F'(y) and it follows that m < [K : F(y)| 
which proves the assertion. 

To illustrate these definitions, consider the case of the rational function 
field F(x). Let A = F(z]. Every non-zero prime ideal in A is generated 
by a unique monic irreducible P. The localization of A at P, Ap, is a 
discrete valuation ring. We continue to use the letter P to denote the unique 
maximal ideal of Ap. It is clear that P is a prime of F(x) in the above sense. 
This collection of primes can be shown to almost exhaust the set of primes 
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of F(x). In fact, there is just one more. Consider the ring A’ = F[r71] 
and the prime ideal P’ generated by x~! in A’. The localization of A’ at 
P’ is a discrete valuation ring which defines a prime of F(x) called the 
prime at infinity. This is usually denoted by P,. or, more simply, by “oo” 
alone. The corresponding ord-function, ord.., attaches the value — deg(/) 
to any polynomial f € A and thus the value deg(g) — deg(f) to any rational 
function f/g where f,g € A. The reader may wish to supply the proof that 
the only primes of F(x) are the ones attached to the monic irreducibles, 
called the finite primes, together with the prime at infinity. The degree of 
any finite prime is equal to the degree of the monic irreducible to which it 
corresponds, and the degree of the prime at infinity is 1. 

Returning to the general case, the group of divisors of K, Dx, is by 
definition the free abelian group generated by the primes. We write these 
additively so that a typical divisor looks like D = }>,a(P)P. The coeffi- 
cients, a(P), are uniquely determined by D and we will sometimes denote 
them as ordp(D). The degree of such a divisor is defined as deg(D) = 
>»_p a(P) deg P. This gives a homomorphism from Dx to Z whose kernel 
is denoted by D%, the group of divisors of degree zero. 

Let a € K*. The divisor of a, (a), is defined to be 5), ordp(a)P. It is 
not hard to see that (a) is actually a divisor, i.e., that ordp(a) is zero for 
all but finitely many P. The idea of the proof will be included in the proof 
of Proposition 5.1 (given below). The map a —> (a) is a homomorphism 
from K* to Dx. The image of this map is denoted by Px and is called the 
group of principal divisors. 

If P is a prime such that ordp(a) = m > 0, we say that P is a zero of a 


of order m. If ordp(a) = —n < 0 we say that P is a pole of a of order n. 
Let 
(a)o = S> ordp(a) P and (ajo =-— S- ordp(a) P. 
P P 
ordp(a)>0 ord p(a)<0 


The divisor (a), is called the divisor of zeros of a and the divisor (a), is 
called the divisor of poles of a. Note that (a) = (a), — (@)oo. 


Proposition 5.1. Leta € K*. Then, ordp(a) = 0 for all but finitely many 
primes P. Secondly, (a) = 0, the zero divisor, if and only ifa € F*, t.e., a 
1s a non-zero constant. Finally, deg(a), = deg(a). = [K : F(a)]. It follows 
that deg(a) = 0, t.e., the degree of a principal divisor is zero. 


Proof. (Sketch) Ifa € F™, it is easy to see from the definitions that (a) = 0. 
So, suppose a € K* — F*. Then, as we have seen, K is finite over F(a). Let 
R be the integral closure of Fa] in K. Ris a Dedekind domain (see Samuel 
and Zariski [1], Chapter V, Theorem 19). Let Ra = Pi B5? --- 85° be the 
prime decomposition of the principal ideal Ra in R. The localizations of R 
at the prime ideals 3; are primes of the field K. If we denote by P, the 
maximal ideals of these discrete valuation rings we find that ordp, (a) = e. 
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It is now not hard to show that the finite set {P,, Po,...,P,} is the set 
of zeros of a. Applying the same reasoning to a~+ we see that the set of 
poles of a are is also finite. This proves the first assertion. It also proves 
the second assertion since if a is not in F'* we see that the set of P such 
that ordp(a) > 0 is not empty. 

To show [K : F'(a)| = deg(a), = deg(a).. we can apply Theorem 7.6 of 
this book if we assume that F is a perfect field. For the general case, see 
Deuring {1}, Chevalley [1], or Stichtenoth [1]. 


For emphasis we point out that implicit in the above sketch is the fact 
that every non-constant element of K has at least one zero and at least one 
pole. 

Two divisors, D,; and Do, are said to be linearly equivalent, D, ~ Do if 
their difference is principal, i.e., D, — Do = (a) for some a € K™. Define 
Clk = Dx/Px, the group of divisor classes. Since the degree of a principal 
divisor is zero, the degree function gives rise to a homomorphism from Clr 
to Z. The kernel of this map is denoted Cl, the group of divisor classes 
of degree zero. 

We are almost ready to state the Riemann-Roch theorem. Just two more 
definitions are needed. A divisor, D = $),a(P)P, is said to be an effective 
divisor if for all P, a(P) > 0. We denote this by D > 0. 


Definition. Let D be a divisor. Define L(D) = {x € K*|(x)+D > 
0 }U{O}. It is easy to see that L(D) has the structure of a vector space over 
F’ and it can be proved that it is finite dimensional over F’ (see Exercises 
17 and 18). The dimension of L(D) over F' is denoted by [(D). The number 
[(D) is sometimes referred to as the dimension of D. 


Lemma 5.2. If A and B are linearly equivalent divisors, then L(A) and 
L(B) are isomorphic. In particular, |(A) = l(B). 


Proof. Suppose A = B+ (h). Then a short calculation shows that x > xh 
is an isomorphism from L(A) with L(B). 


Lemma 5.3. If deg(A) < 0 then I(A) = 0 unless A ~ 0 in which case 
I(A) =1. 


Proof. If deg(A) < 0 and x € L(A), then deg((z) + A) is both < 0 and 
> 0 which is a contradiction. If deg(A) = 0 and L(A) is not empty, let 
x € L(A). Then (x) + A > 0 and has degree zero, so it must be the zero 
divisor. Thus, A ~ 0. Conversely, if A ~ 0, then /(A) = 1(0) = 1 since 
L(0) = F because x € L(0) implies x has no poles and so z € F. 


Before stating the Riemann-Roch theorem it is worth pointing out that 
Lemma 5.2 shows /(A) depends only on the class of A. Similarly, deg(A) 
depends only on the class of A. Thus we could define /(A) and deg(A) and 
state Riemann-Roch in terms of divisor classes. However, we prefer to state 
it in terms of divisors which is more customary. 
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Theorem 5.4. (Riemann-Roch) There is an integer g > 0 and a divisor 
class C such that for C €C and A € Dx we have 


I(A) = deg(A) — g +1 +1(C — A). 


The proof will be given in the next chapter. For other treatments see 
Chevalley {1], Deuring [1], Eichler [1], Moreno [1], or Stichtenoth [1]. The 
integer g is uniquely determined by K, as we shall see, and is called the 
genus of K. The genus of a function field is a key invariant. The divisor 
class C is also uniquely determined and is called the canonical class. It is 
related to differentials of K. In the next chapter we will define the notion 
of a Weil differential. To each Weil differential will be associated a divisor. 
It turns out that all such divisors are equivalent and that C, the canonical 
class, is the equivalence class of divisors of Weil differentials. 

We now give a series of corollaries to this important theorem. 


Corollary 1. (Riemann’s inequality) For all divisors A, we have I(A) > 
deg(A) —g+1. 


Corollary 2. For C €C we have l(C) = g. 

Proof. Set A = 0 in the theorem. 

Corollary 3. ForC €C we have deg(C’) = 2g — 2. 
Proof. Set A = C in the theorem, and use Corollary 2. 


Corollary 4. If deg(A) > 29 —2, then I(A) = deg(A) —g+1 except in the 
case deg(A) = 2g —2 and AEC. 


Proof. If deg(A) > 2g — 2, then deg(C — A) < 0. Now use Lemma 5.3. 


Corollary 5. Suppose that g’ and C” have the same properties as those of 
g and C stated in the theorem. Then, g=g' andC ~C’, 


Proof. Find a divisor A whose degree is larger than max(2g — 2, 2g’ — 2) (a 
large positive multiple of a prime will do). By Corollary 4, [(A) = deg(A) — 
g+1 = deg(A)—g’+1. Thus, g = g’. Now set A = C” in the statement of the 
theorem. Using Corollaries 2 and 3, applied to C’, we see that 1(C—C”) = 1. 
There is an x € K* such that (x) + C —C’ > 0. On the other hand, 
(x) +C —C’ has degree zero by Corollary 3. Thus, it is the zero divisor, 
and C~ C", 


As an example of these results, consider the rational function field F(z). 
Let (Roo, Ps.) be the prime which is, as we have seen, the localization of 
the ring F'[1/z] at the prime ideal generated by 1/2. The corresponding ord 
function is ord..(f) = —deg(f). By Corollary 4, for n large and positive 
we must have /(nP..) =n—g+1. On the other hand, one can prove that 
f € L(nP,) if and only if f is a polynomial in T' of degree < n. Thus, 
l(nP.o) =n +1. It follows that g = 0. From this and Corollary 3 one sees 
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that C has degree —2. It can be shown that Cl% = (1) so there is only one 
class of degree —2 and we can choose any divisor of degree —2 for C. A 
conventional choice is C = —2P.,,. 

We can characterize the rational function field intrinsically as follows: 
K/F is a rational function field if and only if there exists a prime P of kK 
degree 1 and the genus of K is 0. We have seen that rational function fields 
have this property. Now, assume these conditions and consider /[(P). Since 
g = 0 we have [(D) = deg D—g+1=degD+1 for degD > 2g —2 = 
—2. Thus, /(P) = 2 and we can find a non-constant function x such that 
(x) + P > 0. Since deg ( (x) + P) = 1, it follows that (x) + P = Q, a prime 
of degree 1. Thus, (x) = Q — P and it follows that [K : F(x)] = 1. Thus, 
K = F(z) as asserted. 

In the same way one can investigate fields of genus 1. Assume K is a 
function field of genus 1 and that there is a prime P of degree 1. Such a 
field is called an elliptic function field. By the above results we have , for 
any divisor D, 1(D) = deg(D) if deg(D) > 2g —-2 = 0. Thus, I(nP) = 
n for positive integers n. Taking n = 2 and n = 3 we see there exist 
functions x and y with polar divisors 2P and 3P, respectively. It follows 
that [K : F(x)] = 2 and [K : F(y)| = 3 so that K = F(a,y). We see that 
y must satisfy a quadratic equation over F'(x). One can prove much more. 
If the characteristic of F is different from 2 one can show that by a small 
change of variables y can be chosen so that y? = f(x) where f(x) is a cubic 
polynomial of degree 3 without repeated roots. See Silverman [3] for more 
details. 

For the rest of this section we assume that F = F is a finite field with 
q elements. A function field in one variable over a finite constant field is 
called a global function field. Our next goal is to define the zeta function 
of a global function field K/F and to investigate its properties. 

It was proven by F.K. Schmidt, Schmidt [1], that a function field over a 
finite field always has divisors of degree 1. We will assume this, although 
it is possible to give a proof without introducing any new concepts. Using 
Schmidt’s theorem, we have an exact sequence 


(0) > Clk > Clk — Z = (0). 


We will prove shortly that the group Cl% is finite. Denote its order by 
hx. The number hx is called the class number of the field K. This number 
is an important invariant of K and has been the object of much study. The 
above exact sequence shows that for any integer n there are exactly hx 
classes of degree n. 


Lemma 5.5. For any integer n > 0 the number of effective divisors of 
degree n is finite. 


Proof. (Sketch) Choose an x € K such that x is transcendental over F. 
Kk /F(z) is finite. The primes of F(z) are in one to one correspondence with 
the monic irreducible polynomials in F(z] with the one exception of the 
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prime at infinity. Thus, there are only finitely many primes of F(x) of any 
fixed degree. By standard theorems on extensions of primes (see Chapter 
7) one sees that there are only finitely many primes of K of fixed degree. If 
> p a(P)P is an effective divisor of degree n then each prime that occurs 
with positive coefficient must have degree < n. There are only finitely many 
such primes. Moreover the coefficients must be < n, so there are at most 
finitely many such effective divisors. 


We define a, to be the number of primes of degree n and 6, to be the num- 
ber of effective divisors of degree n. Both these numbers are of considerable 
interest. 


Lemma 5.6. The number of divisor classes of degree zero, hx, is finite. 


Proof. Let D be a divisor of degree 1. If A is any divisor of degree 0, then 
deg(gD+ A) = g and so by Riemann’s inequality, [(gD+A) >g-g+1=1. 
Let f € LigD + A). Then, B = (f) + 9D+A2>0 and so A~ B—gD 
where B is an effective divisor of degree g. It follows that the number of 
divisor classes of degree zero is bounded above by the number of effective 
divisors of degree g which is finite by Lemma 5.5. More precisely, what we 
have shown is that hy < by. 


We have now proved that the class number hx = |Cl¥,| is finite. Later we 
will give estimates for the size of hx derived from the Riemann hypothesis 
for function fields (see Proposition 5.11). 


Lemma 5.7. For any divisor A, the number of effective divisors in A is 


gi (A) _4 
q-1 ° 


Proof. We begin by showing that A contains effective divisors if and only 
if 1(A) > 0. 

Suppose B € A and is effective. There is an f € K* such that (f)+A= 
B>0,so0 f € L(A) and I(A) > 0. The converse is obtained by just running 
this proof backwards. 

Suppose /(A) > 0. The map from L(A) — {0} to effective divisors in A 
given by f > (f)+A is onto. Two functions f and f’ have the same image 
iff (f) +A = (f’) +A iff (f) = (f') iff (f’f7') = 0. The last condition 
happens iff f’f—1 is in F* by Proposition 5.1. Since L(A) — {0} has q'(4) —1 
elements and the fibers of our map have q — 1 elements, the result follows. 

Finally, if 1(A) = 0, then q'(4) — 1 = 0 and the result holds in this case 
as well. 


For A € Dx define the norm of A, NA, to be q?°#(4), Note that NA is a 
positive integer and that for any two divisors A and B we have N(A+B) = 
NANB. 


Definition. The zeta function of K, ¢x(s), is defined by 
(x(s)= SNA. 


A>0 
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Over the rational function field k = F(T) we did not discuss the zeta 
function of k but rather the zeta function associated to the ring A = F(T]. 
These are closely related. In fact, it is not hard to prove that ¢,4(s) = 
Ce(s)(1—q78), so Ce (s) = (1—q'~8)7!(1—q7$)7?. Also in the general case 
it is sometimes useful to associate zeta functions with appropriate subrings 
of the field. However, for the purposes of the following discussion we will 
concentrate on the zeta function of the field 

The term NA~® in the definition of the zeta function is equal to q7 
where n is the degree of A. Thus the zeta function can be rewritten in the 


form 
oe) bn 
CK (8) — » ns ° 
n=1 q 


Another key fact is that we have an Euler product for ¢x(s). Using the 
multiplicativity of the norm and the fact that Dx is a free abelian group 
on the set of primes we see, at least formally, that 


¢x(s) =|] € - xm) 3 


Recalling that a, is the number of primes of degree n, we observe that 
this expression can be rewritten as follows: 


éx(s) = T] (0 - =) | 


n=l 


TUS 


We shall soon see that all these expressions converge absolutely for 
R(s) > 1 and define analytic functions in this region. 


Lemma 5.8. Leth =hx. For every integer n, there are h divisor classes 
of degree n. Suppose n > 0 and that {A,, Ao,..., An} are the divisor classes 
of degree n. Then the number of effective divisors of degree n, by, is given 


h gi(Av—1 


Proof. The first assertion follows directly from Lemma 5.6 and the remarks 
preceding Lemma 5.5. The second follows just as directly from Lemmas 5.6 
and 5.7. 


By Lemma 5.7 and Corollary 4 to Theorem 5.4, we see that ifn > 2g—2, 


then b, = hy2——=+. It follows that b, = O(q”). From this fact, and the 
expression €x(s) = )>-_) bng~™, it follows that ¢x(s) converges absolutely 
for all s with R(s) > 1. 

In the same way we can prove the product expression for ¢x% (s) converges 
absolutely for R(s) > 1. To do this it suffices, by the theory of infinite 
products, to show that 5>°°_, an|q7”*| converges in this region. This follows 


immediately since an < b, = O(q"). 
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The next thing to do is to investigate whether ¢x(s) can be analytically 
continued to all of C and whether it satisfies a functional equation, etc. 
The next theorem shows that the answer to both these questions is yes, 
and that a lot more is true as well. 


Theorem 5.9. Let K be a global function field in one variable with a finite 
constant field F with q elements. Suppose that the genus of K is g. Then 
there is a polynomial Lx(u) € Zu] of degree 2g such that 


Le(q*) 
(1 —q78)(1—q!-5) 


This holds for all s such that #(s) > 1 and the right-hand side provides an 
analytic continuation of Cx(s) to all of C. ¢y(s) has simple poles at s = 0 
ands = 1. One has Lx (0) = 1, Li, (0) = ay —1—4q, and Lx(1) = hex. 
Finally, set Ex(s) = q9~8¢x(s). Then for all s one has Ex(1—s) = €x(s) 
(this relationship is referred to as the functional equation for Cx (s)). 


CK(s) = 


Proof. It is convenient to work with the variable u = q~*. Then 


6x(s) 2! Ze(u) = S~ bau” 


n=0 
: nm—gtl_ 
We noted earlier that for n > 2g — 2 we have b, = hx qt. Sub- 
stituting this into the above formula and summing the resulting geometric 
series, yields 


From Equation 2, we see the expression for ¢;,(s) given in the theorem is 
correct. We will show that Dx(1) and Lx(q7') are both non-zero. Thus, 
Ck(s) has a pole at 0 and 1. The fact that deg Lx(u) < 29 also follows 
from this calculation. Substituting u = 0 yields Lx (0) = 1. Comparing the 
coefficients of u on both sides yields b; = Li, (0) +1+4q. It is easy to see 
that 6; = a; = the number of primes of K of degree one. 

From Equation 1 above, we see that limy41(u—1)ZxK(u) = hx/(q—1). 
From Equation 2 we see 


lim (u—1)Z¢(u) = LE | 


ul l—q 


Thus, Lx (1) = hx, as asserted. 
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As for the functional equation, recall that bp = )iaep A=n (qi(A) — 1)/ 
(q — 1). Then, 


_ F i i 1 
(q-1)Zx(u) = S- S- gA) —i)ynr- \- g'A) dee A “hk; — 
n=0 deg A=n deg A>0 
_ _ l - _ 
_ (A), degA __ 1(A), deg A 
7 Dd oo MG —U 3s _ vo 
O<deg A<2g—2 2g—2<deg A<oco 
_ _ 2g—1 
= SD gAyted he—— thet — 
1—u 1 — qu 


O<deg A<2g—2 
Multiplying both sides by u!~9 we have (q—1)u'~9Zx(u) = R(u)+S(u) 
where 


1—g qi ud 


R(u) _— S- gAysee A -gt1 and S'(u) = —hk — 7 i, qu 
O<deg A<2g—2 


1 1 


A direct calculation shows that S(u) is invariant under u — q7‘u~. 
R(w) is also invariant under this transformation. To see this, first note that 


R(q7*u7!) = ‘> gi(A)+9-1-deg A, deg A+g—1 


deg A<2g—2 
From the Riemann-Roch Theorem, Theorem 5.4, and Corollary 3, we see 
l(c — A) = deg(C — A) -—g+1+1(A)=g-—1-—degA+I(A) . 
Substituting this expression into the formula for R(q~'u7') yields 


R(q7tu7!) _ S- gi(C- A) ysea(C—A)—9 +1 
deg A<2g-—2 


Since A —> C — A is a permutation of the divisor classes of degree d with 
0<d< 2g—2 it follows that R(q~'u—') = R(u) as asserted. We have now 
completed the proof that u!~9Zx(u) is invariant under the transformation 

Since u'~9Z(u) is invariant under u > q~tu™?, it follows easily that 
q 9u~*9 LK (u) = Le (q7'u-"*). Letting u > oo we see that deg Lx (u) = 2g 
and that the highest degree term is q9u79. 

Finally, recalling that u = q~*, we see that u!-9 = q'9-)* and the 
transformation u — q~!u7! is the same as the transformation s > 1 — s. 
So passing from the u language to the s language we see we have shown 
€x(s) is invariant under s > 1 — s, as asserted. 
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The polynomial Lx%(u) defined in the theorem carries a lot of informa- 
tion. Since the coefficients are in Z we can factor this polynomial over the 
complex numbers, 

29 
Lx(u) = |] - mu) 


t=1 


It is worth pointing out that the relation Lx (q~'u7') = q79u~*9 Lx (u) 


implies that the set {71,72,...,72g} is permuted by the transformation 
mt — q/t. This is easily seen to be equivalent to the functional equation for 
Cx(s). 


Since ¢x(s) has a convergent Euler product whose factors have no zeros in 
the region Re(s) > 1, it follows that ¢x(s) has no zeros there. Consequently, 
Lx(u) has no zeros in the region {u € C | Ju] < q7*}. For the inverse roots, 
™;, the consequence is that |7,;| < gq. We will prove later, Proposition 5.13, 
that |7;| < q for all and this will have a number of important applications. 
However, much more is true. The classical generalized Riemann hypothesis 
states that the zeros of ¢x(s), the Dedekind zeta function of a number field 
A, has all its non-trivial zeros on the line R(s) = 1/2. Riemann conjectured 
this for ¢(s), the Riemann zeta function. Neither Riemann’s conjecture nor 
its generalizations are known to be true. In fact, these are among the most 
important unsolved problems in all of mathematics. However, the analogous 
statement over global function fields was proved by A. Weil in the 1940s. 


Theorem 5.10. (The Riemann Hypothesis for Function Fields) Let K be 
a global function field whose constant field F has q elements. All the roots of 
CK(s) lie on the line R(s) = 1/2. Equivalently, the inverse roots of Lx(u) 
all have absolute value ,/q. 


Theorem 5.10 was first conjectured for hyper-elliptic function fields by 
E. Artin in his thesis, Artin [1]. The important special case when g = 1 was 
proven by H. Hasse. The first proof of the general result was published by 
Weil in 1948. Weil gave two, rather difficult, proofs of this theorem. The first 
used the geometry of algebraic surfaces and the theory of correspondences. 
The second used the theory of abelian varieties. See Weil [1] and Weil 
[2]. The whole project required revisions in the foundations of algebraic 
geometry since he needed these theories to be valid over arbitrary fields not 
just algebraically closed fields in characteristic zero. In the early seventies, 
a more elementary proof appeared due, in a special case to Stepanov, and 
in the general case to Bombieri [1]. We will give an exposition of Bombieri’s 
proof in the appendix to this book. 

Here are two simple but important consequences of the Riemann Hy- 
pothesis. 


Proposition 5.11. The number of prime divisors of degree 1 of K, ax, 
satisfies the inequality ja; — q —1| < 2g,/q. Also, (,/¢g — 1)799 < he < 


(./g + 1)?9. 
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Proof. By Theorem 5.9, L4,(0) = a;—q—1. From the above factorization of 
LK (u) we see —Li, (0) = 71 +72+--++7 2g. The first assertion is immediate 
from this and ‘Theorem 5.10. 

As for the second assertion, we have hy = Lx(1) = “9 (1 ~—1;), by 
Theorem 5.9. Now use Theorem 5.10 once again. 


Here are several qualitative consequences of this proposition. If q is big 
compared to the genus, then there must exist primes of degree one. Indeed, 
a;/q — 1 if we fix g and let q grow. Secondly, if gq > 4 we must have 
hy > 1. Also, if we fix g and let q tend to infinity then hx /q? — 1 (here, 
K is varying over global fields of fixed genus g with varying constant fields). 
Moreover, if we fix q > 4 and let g grow, then hx > oo. 

We can now present a generalization of Proposition 2.3, which, as we 
pointed out, is an analogue of the prime number theorem. 


Theorem 5.12. 
gN qe N 
an = #{P| deg(P) = N}==-+0 a 


Proof. Using the Euler product decomposition and Theorem 5.9, we see 


OO 


12, (1 — mi) 
Z =a — yo) 4 
K(u) (1 — w)(1 — qu) = He us 


Take the logarithmic derivative of both sides, multiply the result by u , 
and equate the coefficients of u% on both sides. We find 


2g 
gq +1-S ony = S— dag 
i=l d|N 


Using the Mobius inversion formula, yields 
2g N 
Nan = Drala? «0+ 9a) (Yond) 
d|N d|N i=1 


Let e(N) be —1 if N is even and 0 if N is odd. Then, as we saw in the 
proof of Proposition 2.3, 


Souda? =q% —e(N)q? +O(NG*). 
d|N 


Similarly, using the Riemann hypothesis, we see 


29 
S- u(d) (yor) < 29q? +2gNq@ . 
i=1 


d|N 
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Putting the last three equations together, we find 
Nan = q 4+ O(q™) . 
The theorem follows upon dividing both sides by N. 


Note that, in this proof, it was crucial to know the size of the zeros of the 
zeta function. The proof of Proposition 2.3 was so easy because the zeta 
function of A = F[T] has no zeros! 

We wish to derive yet another expression for the zeta function. To this 
end we consider once more the equation 


Z«(u) = ][Q-u%)™. 


d=1 


Take the logarithm of both sides and write the result as a power series 
in u using the identity —log(1 — u) = S>°°_, u™/m. The result is 


log ZK (u = ore m 


where the numbers N,, are defined by Nm = dcaim daa . These numbers 
have a very appealing geometric interpretation, which we shall explain in 
more detail later. Roughly speaking, what is going on is that the function 
field A/F is associated to a complete, non-singular curve X defined over 
F. The number N,,, is the number of rational points on X over the unique 
field extension F,,, of F of degree m. In any case, using these numbers, the 
zeta function can be given by 


ZK (wu) = exp (> “am . 


m=l1 


In the course of the proof of Theorem 5.12, we showed that 


29 
Nm = gq" +] ~Sonm 
1=1 


This equality plays an important role in the proof of the Riemann hy- 
pothesis for function fields. If we assume the Riemann hypothesis, another 
consequence is 


\Nm -— 9g” —1| < 2gq? . 


We will interpret this inequality in Chapter 8 when we discuss constant 
field extensions of function fields (see Proposition 8.18). 
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We conclude this chapter by showing how to obtain a weaker result than 
the Riemann hypothesis, which nevertheless is strong enough to give a proof 
that 


N 
an = #{P | deg(P) =N}~ =. 


Before the statement and proof of the next proposition, we need to deal 
with an important technical point. Since ¢x(s) is a rational function of 
q~*, it is a periodic function of s with period 27i/log(q). Since it has a 
pole at s = 0 and at s = 1 it has infinitely many poles on both the line 
ke(s) = 0 and the line R(s) = 1. We will be concerned with the latter line. 
From Theorem 5.9, we see that all the poles on this line are at the points 
1+ 2mmi/log(q) for m € Z. 


Proposition 5.13. Let K be a global function field. The zeta function of 
K, Cx(s), does not vanish on the line R(s) = 1. 


Proof. The proof of this proceeds, for the most part, exactly as in the case 
where K is a number field. It is based on the trigonometric inequality 


3+4cos@+cos20>0. 


The proof of this inequality consists of nothing more than noticing that 
the left hand side is 2(1 + cos@)?. 

Write s = 0 + it where o and ¢ are real. Assume that o > 1. Then a 
short calculation with the Euler product for ¢x%(s) yields 


R log¢x(s) = )> m7 NP~” cos(tlog NP”) . 
Pym 


Now, replace t with 0,t,and 2t and use the above identity to derive 
3x log Cx (o) + 4Rlog Cx (o + it) + Rlog¢x(o + 2it) > 0. 


Exponentiating, we find 


ICK(o) |? Cx (o + it)|*|Cx(o + 2it)| > 1. 


This inequality holds for 0 > 1 and all real t. Suppose ¢ is such that 
Cx(1 + it) = 0. Of course, such a t cannot be zero. It follows that ¢x%(o + 
it)/(o — 1) is bounded as ao > 1. We know that (o — 1)¢xK(c) is bounded 
as 0 —> 1 since by Theorem 5.9, ¢x(s) has a simple pole at s = 1. Finally, 
Cx(o + 2it) is bounded as o > 1 provided that t is not an odd multiple 
of 7/log(q) (see the remarks preceeding the Proposition). Assume this for 
now. Putting everything together shows that the left-hand side of the above 
inequality tends to zero as 0 — 1, which contradicts the fact that it is 
always greater than or equal to 1. 

a suppose that ¢ is an odd multiple of 27/log(q). In this case, q7U+™) = 

~1. We must show that ¢x (1+ it) = Zx(—q7!) #0. By the functional 
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equation, Z%(—q7') is related to Z,%(—1), which in turn is not zero if and 
only if Lx(—1) 4 0. To show this we must, unfortunately, use a result from 
a later chapter, namely, Theorem 8.15. 

Let F2 be a quadratic extension of the constant field F. One can form 
a new function field from K by extending the field of constants from F to 
F.. Call this new field Ky. Using Theorem 8.15 we can derive the following 
relation between Lx, (u) and Lx (u). 


Lx, (u*) = Lye(u)LRe(—u) . 


Substitute u = 1 into this relationship and use Theorem 5.9 once again. 
We find that hx, = hx Lx (—1) from which it is clear that Dx (—1) 4 0. 


Corollary. There is a real number @ < 1 such that ¢x(s) does not vanish 
in the region {s € C | Rs > O}. 


Proof. The zeta function is represented by a convergent Euler product in 
the region {s € C | #(s) > 1} and so doesn’t vanish there. By the functional 
equation (see Theorem 5.9) it doesn’t vanish in {s € C | R(s) < 0} either. 
From the Proposition it doesn’t vanish on the boundary of these regions. 

The key point that makes the function field case different from the num- 
ber field case is that ¢x(s) is a function of g~* and so it is periodic with 
period 277/ log q. Thus we may confine our search for zeros to the compact 
region {s € C|0< R(s) < 1, 0 < G(s) < 2mi/logq}. The zero set of an 
analytic function is discrete, so the number of zeros in this region is finite. 
The corollary follows immediately. 


The Riemann hypothesis for the function field case is that 6 can be taken 
to be 1/2. It is worth pointing out that nothing as strong as the above 
corollary is known to be true in the number field case. Zero free regions to 
the left of the line R(s) = 1 are known to exist, but the boundary of these 
regions approach the line as |S(s)| — oo. 

Translating the above corollary into a result about Dx(u) = 2,( — 
mu), we see that the assertion is that |m;| < q° for all 1 < i < 2g. If we 
use this estimate instead of the Riemann hypothesis and follow the steps 
of the proof of Theorem 5.12, we arrive at the following result. 


N ON 


an = #{P | deg(P) = N} = +0 (45) | 


As promised, this is good enough to show ay ~ q%/N as N —> oo, a 
result which is much weaker than Theorem 5.12, but is still very interesting. 


Exercises 


1. Suppose K/F has genus zero. For a divisor D with deg D > —1 show 
that 1(D) = deg D +1. 
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. Suppose K’/F has genus zero and that C’ is a divisor in the canonical 


class. Show [(—C’) = 3 and conclude that there is a prime P of degree 
less than or equal to 2. 


. Suppose K/F has genus zero and that there is a prime P of degree 


1. Show K = F(x) for some element x € K. 


. Suppose K/F has genus zero and that P is a prime of degree 2. 


By Exercise 1, 1(P) = 3. Let {1,2,y} be a basis for L(P). Show 
K = F(z,y). Show further that {1,2,y,27,y?,cy} Cc L(2P) and 
conclude that x and y satisfy a polynomial of degree 2 over F’. 


. Suppose that K/F has genus 1. Show that [(D) = deg D for all 


divisors D with deg D > 1. 


. Suppose K/F has genus 1 and that P is a prime of degree 1. By the 


last exercise we know |(2P) = 2 and I(3P) = 3. Let {1,2r} be a basis 
of L(2P) and {1,2,y} be a basis of L(3P). Show that kK = F(z, y). 
Show also that x and y satisfy a cubic polynomial with coefficients 
in F’ of the form 


Y*2 +a,XY +asY = X° +a9X7 + a4X +46 . 


Hint: Consider L(6P). 


. Let K/F be of positive genus and suppose there is a prime P of 


degree 1. Suppose further that [(2P) has dimension 2. Let {1,x} be 
a basis. If char F’ 4 2, show that there is an element y € K such that 
K = F(z,y) and such that x and y satisfy a polynomial equation 
of the form Y* = f(X) where f(X) is a square-free polynomial of 
degree at least three. 


. Use the Riemann-Roch theorem to show that if B and D are divi- 


sors such that B + D is in the canonical class, then |/(B) —1(D)| < 
5 | deg B) — deg(D)|. 


. Suppose P is a prime of degree 1 of a function field K/F'’. For every 


positive integer n show I((n + 1)P) —U(nP) <1. 


Let K/F be a function field of genus g > 2, and P a prime of degree 
1. For all integers k we have l(kP) < l((k + 1)P). If we restrict k to 
the range 0 < k < 2g — 2 show there are exactly g values of k where 
l(kP) = l((k+1)P). These are called Weierstrass gaps. Assume F 
has characteristic zero. If all the gaps are less than or equal to g we 
say P is a non-Weierstrass point, if not, we say P is a Weierstrass 
point. It can be shown that there are only finitely many Weierstrass 
points. In characteristic p there is a theory of Weierstrass points (due 
to H. Schmid), but the definition is somewhat different. 
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12. 
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Suppose K/F has genus 1 and that P, is a prime of degree 1, also 
called a rational point. Let E(F) denote the set of rational points. 
If P,Q € E(F), show there is a unique element R € E(f’) such that 
P+Q~R+ Po. (Recall that for two divisors A and B, A~ B 
means that A — B is a principal divisor). Denote R by P 6 Q. Show 
that (P,Q) > P@®Q makes E(F) into an abelian group with P. as 
the zero element. 


With the same assumptions as Exercise 11, map E(F’) > Cl by 
sending P to the class of P — P,,. Show that this map is an isomor- 
phism of abelian groups. 


Let K/F be a function field and o an automorphism of K, which 
leaves F fixed. If (O, P) is a prime of K, show that (cO,aP) is also 
a prime of K. Show, further, that for all a € K, we have ordgp(a) = 
ordp(a~1a). 


(Continuation). The map P — oP on primes extends to an action of 
o on divisors. If a € K*, show that o(a) = (aa). 


(Continuation). If D is a divisor of K, show a - oa induces a linear 
isomorphism from L(D) > L(oD). In particular, if o fixes D, i.e., 
oD = D, then o induces an automorphism of L(D). 


(Continuation). Suppose P is a prime of degree 1 and that oP = P. 
Then, o induces an automorphism of L((2g + 1)P). If this induced 
map is the identity, show that o is the identity automorphism. (Hint: 
Find two elements x,y € K* fixed by o such that K = F(z,y)). 


Let A be a divisor and P a prime divisor. Suppose g € L(A + P) — 
L(A). If f € L(A+P) show f/g € O,. Use this to prove (A+ P) < 
I(A) + deg(P). 


Use Exercise 17 to show 1(A) < deg(A)+1 if A is an effective divisor. 
Show further that this inequality holds in general. Thus, /(A) is finite 
for any divisor A. 


6 


Weil Differentials 
and the Canonical Class 


In the last chapter we gave some definitions and then the statement of the 
Riemann-Roch theorem for a function field K/F. In this chapter we will 
provide a proof. In the statement of the theorem an integer, g, enters which 
is called the genus of K. Also, a divisor class, C, makes an appearance, 
the canonical class of K. We will provide another interpretation of these 
concepts in terms of differentials. Thus, differentials give us the tools we 
need for the proof and, as well, lead to a deeper understanding of the 
theorem. In addition, the use of differentials will enable us to prove two 
important results: the strong approximation theorem and the Riemann- 
Hurwitz formula. The first of these will be proven in this chapter, the second 
in Chapter 7, where we will also prove the ABC conjecture in function fields 
and give some of its applications. 

We will use a notion of differential which is due to A. Weil. It is somewhat 
more abstract than the usual definition but has the advantage of requiring 
no special assumptions about the constant field. Also, it leads to very short, 
conceptual proofs of the two theorems mentioned in the last paragraph. We 
will motivate the definition by first discussing some properties of differen- 
tials on compact Riemann surfaces. If the reader is unfamiliar with this 
theory, he or she can skip directly to the definition of Weil differential in 
the purely algebraic setting. 

Let X be a compact Riemann surface of genus g, M the field of mero- 
morphic functions on X, and 2 the space of meromorphic differentials on 
X. Fix a non-zero differential w € Q and a point x € X. Let t be a lo- 
cal uniformizing parameter at x. In some neighborhood around x we can 
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express w in the following form: 


w= 3 a,t* dt. (1) 


t=—N 


If f € M,, the field of germs of meromorphic functions at x, then we 
can integrate fw around a small circle about x to get 277 Res,(fw). If 


f= Sy j=-M bjt, then 


Res, (fw) = 3 axd;. (2) 


i+j=—l 


Let wz be defined to be the C-linear map f > Res,(fw) from M, to 
C. We now look into the question of what restrictions are placed on the 
collection of linear functionals {w, | x € X} by the fact that they arise 
from a differential in the manner indicated. 

We recall the definition of the order of w at a point z, ord,(w). Write 
w locally in terms of a uniformizing parameter as in Equation 1. Then the 
order of w at x is defined to be the smallest index i such that a; 4 0. This 
number is independent of the choice of uniformizing parameter. If a_y 4 0, 
then ord,(w) = —N. It is well known that ord,(w) = 0 for all but finitely 
many points x € X and thus we can associate to w # 0 a divisor: 


(w) = S— ord, (w) a. 


rEx 


This definition will be useful as we go along. For the moment we will show 
how to characterize the number ord,(w) in a different way. Let O; C Mz 
be the ring of germs of holomorphic functions at x. Each element of O, 
has a power series expansion in terms of a uniformizing paramenter t, at x 
with all coefficients of negative index zero. O, is a discrete valuation ring. 
Its unique maximal ideal P, is generated by t,. Every non-zero fractional 
ideal of O; is a power P™ of P, where m can be any integer. With this 
notation we show— 


Lemma 6.1. Let w be a non-zero meromorphic differential, x € X, and 
Wy the linear functional on M, described above. There is an integer N such 
that w, vanishes on PN but not on PN-!. This integer is characterized by 


ord,(w) = —N. 


Proof. Since we are fixing x in our considerations we set t, = t and 
suppose w is expressed in terms of ¢ as in Equation 1. Assume a_y 4 0 so 
that ord,(w) = —N. From equation (2) it is then clear that w, vanishes on 
PN. On the other hand, t’~! € PN—! and w,(t%~!) =a_y £0. 


Corollary. w, is zero on O, but not on P>! for all but finitely many 


crEX. 
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Proof. This follows from the lemma and the fact that ord,;(w) = 0 for all 
but finitely many x € X. 


Lemma 6.1 shows that the linear functionals w, must satisfy certain van- 
ishing properties. These are all local properties only involving the knowl- 
edge of the behavior of w in the neighborhoods of points. In addition, there 
is at least one global constraint. 


Lemma 6.2. For every f € M we have 
S| we(f) = 0. 


Proof. Note that f € M implies f € M, for all x € X, so the terms in the 
sum make sense. Also, f € Oy, for all but finitely many x (on a compact 
Riemann surface a meromorphic function has at most finitely many poles). 
By the corollary to Lemma 6.1, w,(f) = 0 for all but finitely many z € X. 
Thus, the sum is finite. 

Now, fw is also a meromorphic differential. It is a well-known theorem 
that on a compact Riemann surface the sum of the residues of a meromor- 
phic differential is zero. Thus, 


We now have all the background we need to set up the notion of a Weil 
differential. Let A(X) be the subset of [|, M, consisting of elements with 
all but finitely many coordinates in O,. It is clear that A(X) is a ring 
with addition and multiplication defined coordinatewise. We will denote 
the elements of A(X) by ¢ = (fz); ie., the z-th component of ¢ is fy . 
A(X) is a vector space over C in the obvious way, ad = a(fz) = (afer). If 
w EQ, define ©: A(X) > C by 


rex 


By the corollary to Lemma 6.1 and the definition of A(X), the sum on the 
right-hand side of the above equation is finite. 

Let’s map M into A(X) by sending f to (fz), where fr, = f foralla € X. 
Clearly, M is isomorphic to its image under this map and from now on we 
identify M with its image. Lemma 6.2 can now be interpreted as asserting 
that @ vanishes in M. 

Let D = 5°, nz & be any divisor on X. We associate to D a subset of 
A(X), namely, A(D) = {(fz) € A(X) | ord (fz) > —n2, Ve € X} (we 
use the convention that ord,(0) = oo, which is greater than any integer). 
Recall that a divisor D is said to be effective, D > 0, if all its coefficients 
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are non-negative, and one divisor is bigger than another if their difference 
is effective, i.e, D < C iff C— D> 0. One then checks easily that D < C 
implies A(D) C A(C) and that J A(D) = A(X). 

With these definitions, the definition of ©, Lemma 6.1, and Lemma 6.2 
we can easily prove— 


Lemma 6.3. The functional © vanishes on both M and A((w)). Moreover, 
if w vanishes on A(D), then A(D) C A((w)). 


It is possible to show, although we shall not do so here, that if A is a 
linear functional on A(X) and A vanishes on both M and A(D) for some 
divisor D, then there is a unique differential w € Q such that @ = X. For 
the case when X is the Riemann sphere see Chevalley [1], pp. 29-30. This 
being so, in the abstract case we shall, following Weil, define differentials 
to be linear functionals on a certain space, the adele ring of the function 
field, having properties analogous to those we have seen to be true for the 
functionals w on A(X). 

For the remainder of this chapter, let K/F be a function field with con- 
stant field F’. We make no assumptions about F. Other notations will be 
the same as those in Chapter 5, except that we now introduce the new 
notation Sx for the set of prime divisors of K. 

For P € Sx let |alp = 27%?) for a 4 0 and |0|p = 0 (2 is chosen 
for convenience, any number greater than one will do). Define a metric on 
K by pp(a,b) = |a — b|p. We denote by Op and Kp the completions of 
the local ring Op and the field K with respect to this metric. We assume 
that the reader is familiar with standard facts about completions. See, for 
example, Lang [5], Chapter II . The adele ring of K is defined as 


Ak = {a0 E WES | ap € Op for all but finitely many P € sx . 
P 


The analogy between the adele ring Ax of the function field K and the 
ring A(X) which we attached to a compact Riemann surface is clear. 

We imbed K into Ax by taking x € K to (xp) where for all P, xp = z. 
Since for any element x € K, either x = 0 or ordp(z) = 0 for all but finitely 
many P, the image of x is indeed in Ax. K is isomorphic to its image and 
we identify K with its image under this map. 

If D = )/pn(P)P is a divisor of K, define Ax (D) as the set of all (xp) € 
Ax such that ordp(xp) > —n(P) for all P € Sp (notice the minus sign!). 
Then, as in the Riemann surface case, it is easy to see that D < C implies 
Ax(D) € Ax(C) and that () Ax(D) = Ax. It is also useful to notice that 
Ax (D)NAxK(C) = Ax ((D,C)) and Ax (C)+AxK(D) = Ax([C, D]), where 
(C, D) and [C, D] denote the infimum of C and D and the supremum of C 


and D, respectively. More concretely, 


ordp((C,D)) = min(ordp(C),ordp(D)) and 
ordp(|[C,D]) = max(ordp(C), ordp(D)). 
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A very important remark for our further considerations is that Ax(D)N 
K = L(D), the vector space whose dimension over F’, |(D), is the focus of 
interest in the Riemann-Roch theorem. This equality follows directly from 
the definitions of Ax(D), L(D), and the way K is imbedded in Ax (D). 

We note that Ax and the subsets Ax(D) are all vector spaces over F 
under the obvious operation; for a € F’, a(xp) = (arp). With all these 
definitions in place, we can now define a Weil differential. 


Weil Differential. An F-linear map w from Ax to F is called a Weil 
differential if it vanishes on K and on Ax(D) for some divisor D. We 
denote the set of Weil differentials on K by Qx and the set of all Weil 
differentials which vanish on AxK(D) by QK(D). 


A number of remarks are in order. To begin with, many authors define a 
somewhat smaller ring than the adele ring, namely, the ring of repartitions, 
and define Weil differentials using it. The advantage is that one avoids going 
to the completion at all the primes P of K. While this is more elementary, 
some of the proofs become more difficult. In particular, the proof of the 
Riemann-Hurwitz formula is more transparant using the full ring of adeles 
and this is the principal reason we have used adeles in the above definition. 

It is usual to define a topology on Ax by declaring the subsets Ax (D) 
to be the open neighborhoods of the identity (the adele, all of whose coor- 
dinates are zero). We can then say that a Weil differential is a continuous 
F-linear functional on Ax which vanishes on K. We will not, however, 
make much use of topological considerations. 

Ax is a vector space over K and all the sets Ax(D) are vector spaces 
over Ff’ as we have seen. x also can be made into a vector space over K 
by means of the following definition. Let € € Ax and x € K. Define 


(w)(§) = w(x). 


It is clear that zw is an F-linear functional on Ax and that it vanishes 
on K. It requires but a short calculation to see that w € Qx%(D) implies 
rw € Ox ((2) + D). Thus, cw is a Weil differential. 

From now on we will refer to the elements of Qx simply as differen- 
tials rather than Weil differentials. We will show (Proposition 6.7) that the 
spaces {17%(D) are finite dimensional over F. Before doing that, we need 
some important preliminary material. In particular, we need Riemann’s 
inequality, the precursor to the Riemann-Roch theorem. 


Lemma 6.4. Let D<C be divisors of K. Then, 
dim pr Ax(C)/AK(D) = deg C’ — deg D. 


Proof. If C = D the result is clear. Otherwise, C' is obtained from D by 
adding finitely many primes, so it suffices to show that 


dimpr Ax(D + P)/AxK(D) = deg P 
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for any prime P. 
Let P = POp. Let n = ordpD. If € = (ap) € Ax(D+ P), then 


ordp(ap) > —n—1 which is the same as ap € P-"-1. Now map Ax(D+P) 
to P~"—!/P-” by taking € = (ap) to the coset of ap modulo P~”. This 
is clearly an epimorphism and from the definitions the kernel is seen to be 


Ax(D). Thus, 
Ax(D + P)/Ag(D) & P-"71/P™ = P-"" /P™ = Op/P. 
All these isomorphisms preserve F'-vector space structure. Since 
dimr Op/P = deg P, 
the result follows. 
Lemma 6.5. Let D<C be divisors of K. Then, 
Ax(C)+K 
* An(D)+K 


Proof. Recall that Ax(C)N K = L(C). Using the first and second laws of 
isomorphism, the space on the left-hand side of the above equation is seen 
to be isomorphic to 

AK(C) og Ax(C) / Ax(D) 


Ax(D)+L(C)  (Ax(D)+L(C)) / Ax(D) 

Using the first law once again, we see that (Ax(D)+L(C)) / Ax(D) = 
L(C)/L(D). Thus, 
Ak(C)+K 
Ax«(D)+K 


Using Lemma 6.4, the right-hand side is equal to (degC — deg D)) — 
(1(C’) — l(D)) = (deg C — I(C)) — (deg D — l(D)) as asserted. 
Corollary. For a divisor D, define r(D) = deg D —1(D). If D < C, then 
r(D) < r(C). 
Proof. This is an immediate consequence of the Lemma since the dimen- 
sion of a vector space is a non-negative integer. 


dim = (deg C' — 1(C’)) — (deg D —I(D)). 


dimpr = dimpr Ax(C)/Ax(D) —dimp L(C’)/L(D). 


Since both deg D and /(D) only depend on the linear equivalence class 
of D, the same is true of r(D). We will use this remark in a moment. 


Theorem 6.6. (Riemann’s Theorem) Let K/F be an algebraic function 
field with field of constants F. There is a unique integer g > 0 with the 
following two properties. For all divisors D, we have l(D) > deg D—g +1. 
Also, there is a constant c such that for all divisors D with deg D > c 
we have I(D) = deg D—g +1. (g will turn out to be the constant in the 
Riemann-Roch theorem, i.e., the genus of K). 
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Proof. Choose an element x € K*—F*. Then, K/F (2) is a finite extension 
of degree n, say. Let B = (2). be the divisor of poles of x. The primes P 
which occur in the support of B are the only ones for which ordp(z) < 0. 
By Proposition 5.1, deg B = |K : F(x)| =n. 

Consider the integral closure, R, of F[z| in K. If p € R, the only poles of 
p are among the poles of x. Thus, p € L(moB) for some positive integer ™mo. 
It is a standard fact that we can find a basis {1, p2,..-, Pn} for K/F (x) 
such that p; € R for 1 <i < n. Choose a positive integer mo such that 
pi © L(moB) for all 1 <1 <n. For any integer m > mo the elements t py 
with 0 < 7 < m— mo and 1 <i <n are all in L(mB) and are linearly 
independent over F’. We conclude from this that 


(mB) > n(m— mo +1). 
It follows that for any m > mo we have 
r(mB) = degmB —I(mB) < mn —n(m—mo+1) =nmo-7n. 


This shows that r(mB), which is an increasing sequence by the Corollary 
to Lemma 6.5, is bounded above and so must remain constant from some 
point on. Call this maximum value g — 1. Since O < mB, —1 = 7r(O) < 
r(mB) < g—1. It follows that g > 0. 

Let D be any divisor. Write —D = D,; + D2 where the support of D, 
is disjoint from the support of B and the support of D2 is a subset of 
the support of B. Let P be in the support of D,. Then, F'[z] C Op and 
P1 Fx] = (g(x)) where g(x) is a monic, irreducible polynomial. It follows 
that for some positive integer h, (g(x)")+ Dj, has no pole at P. Multiplying 
together the polynomials of this type at each P in the support of D; and 
we wind up with a polynomial f(x) with the property that (f(x)) + D, 
only has poles among those of x. The same is true of Do and so, the same 
is true of (f(x)) — D. It follows that there is a positive integer m such that 


(f(2)) -D+mB>O. 


By the corollary to Lemma 6.5, we deduce r(D) < r((f(z)) + mB) = 
r(mB). It follows that r(D) < g —1 for all divisors D. From the definition 
of r(D), this is equivalent to 


l(D) >degD-—g+1, 


which concludes the proof of Riemann’s inequality. 

We now have to produce a constant c such that 1(D) = degD—g+1 
whenever deg D > c. Let m, be a positive integer large enough so that 
r(m,B) = 9-1. Define c = m,n+g. If D is a divisor with deg D > c, then 
by Riemann’s inequality we find 


l(D—m,B) > deg(D—m,B)-—g+1>1. 
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It follows that there is a y € K* such that (y) + D—m,B>O orm B < 
D+(y). Once again invoking the corollary to Lemma 6.5, we find g —1 = 
r(m,B) < r(D). We have already shown that for all divisors D, r(D) < 
g —1. Thus, r(D) = g — 1, which is the same as [(D) = deg D—g +1. 


The constant ¢ can, in fact, be taken to be 2g — 1. This follows from the 
full Riemann-Roch theorem, as we saw in the last chapter. 

The Riemann-Roch theorem replaces the Riemann inequality with an 
equation. The following proposition is an approximation to what we want. 


Proposition 6.7. For any divisor D of K, the space Nx (D) is finite 
dimensional over F and 


l(D) = deg D—g+1+dimpOx(D). 


Proof. In Lemma 6.5, we are going to fix D and let C’ vary over divisors 
greater than or equal to D. By Riemann’s theorem I(C) > degC’ —g+1 
or what is the same degC — I(C’) < g — 1. So, by Lemma 6.5 


Ax(C)+K — 


dim 

The second part of Riemann’s theorem asserts that there is a constant c 

such that equality holds for all divisors C with degC’ > c. Let Co be any 
divisor greater than or equal to D and with degree bigger than c. Then, 


Axk(C)+K 


dim Fr 
for all divisors C’ bigger than C4. It follows that Ax (C)+K = Ax(C.)+K 
for all C > C,. However, it is easily seen that for any adele € there is a 
divisor C > C, such that € € Ax (C). Thus, Ax(C.) + K = Ax and we 


have shown 


Ax 


To finish the proof one has only to notice that Q«(D) is the F-dual of 
the vector space Ax /(Ax(D)+ K). 


Corollary 1. Let c be the constant in Riemann’s theorem. Then if D is a 
divisor with deg D > c, we have Ak = Ax(D)+ K. 


Proof. We have just shown that dimp(Ax / Ax(D)+K) = 1(D)—deg D+ 
g — 1, which is zero if deg D > c by Riemann’s theorem. Thus Ax = 
Ax(D)+K in this case. 


Corollary 2. The genus of K, g, can be characterized as the dimension 
over F of the space Q (0). 
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Proof. The zero divisor, 0, has degree zero and dimension 1. From the 
proposition we derive 1 =0—g+1+dimprx(0). This gives the result. 


The interested reader can easily show that if on a compact Riemann 
surface a meromorphic differential w is such that @ is zero on A(0), then w 
has no poles and conversely. Thus the space 0. (0) is the analogue of the 
space of holomorphic differentials and this appellation is sometimes used 
even in the abstract case. 

We have now given the promised characterization of the genus in terms 
of differentials. The next task is to give an interpretation of the canonical 
class. To do so we have to show how to assign a divisor to a non-zero 
differential. Since we don’t have (yet) local expressions for a differential at 
a point as in the classical case, we proceed by, in essence, using the result 
of Lemma 6.3 as a definition. That is, if w € Qx, we want to define (w) 
as the largest divisor D such that w vanishes on Ax(D). First we need to 
show there is such a divisor. 


Lemma 6.8. Let w € DK be a non-zero differential. Then, there is a 
unique divisor D with the property that w vanishes on Ax(D) and if D’ is 
any divisor such that w vanishes on Ax(D'), then D’ < D. 


Proof. Let 7 = {D’ | w(Ax(D’)) = 0}. Since w is a differential, 7 is non- 
empty. By Corollary 1 to Proposition 6.7, we see that deg D’ < c for all 
D' €T, since w 4 0. Let D be a divisor of maximal degree in 7. We claim 
that D has the desired properties. Clearly, w vanishes on Ax (D). Suppose w 
vanishes on Ax (D’). Then w vanishes on Ax (D)+AxK(D’) = Ax([D, D’}); 
i.e., [D,D’| € T. Since deg [D, D’) > deg D, it follows that the degrees 
must be equal and so [D, D’] = D, which implies D’ < D as required. The 
uniqueness is clear. 


We now define the divisor of a differential w to be the unique divisor D 
with the properties stated in the Lemma. We use the notation (w) for the 
divisor of w. 


Lemma 6.9. LetwEQK anda € K*. Then, 
(cw) = (x) + (¥). 


Proof. Suppose w € 0x%(D). If € € Ax, then zw vanishes on € if x€ € 
Ax(D), which is equivalent to € € Ax((z) + D). Thus, w vanishes on 
Ax(D) implies zw vanishes on Ax ((x) + D). The converse also holds as 
one can see by observing that w = x~1(rw). Thus, w vanishes on Ax (D) if 
and only if rw vanishes on Ax((xz) + D) and the result follows easily from 
this. 


In the classical case of compact Riemann surfaces 2(.X) is one dimen- 
sional over the field of meromorphic functions on X. To see this, let w,w’ € 
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Q(X) with w £0. Suppose w = f(t)dt and w’ = g(t)dt in a neighborhood 
U ofa point x € X. Here, f(t) and g(t) are Laurent series in a uniformizing 
parameter ¢ about xz. Then hy(t) = g(t)/f(t) is a meromorphic function on 
U which is well defined in that it is independent of the choice of uniformiz- 
ing parameter. This follows easily by use of the chain rule. These functions 
hy fit together to give a meromorphic function h on X and w’ = hw. 

A similar proof cannot be given in the abstract case, but nevertheless an 
analogous result is true. This will follow by the use of Riemann’s theorem 
(once again) together with some elementary linear algebra. 


Proposition 6.10. The space of Weil differentials, Nx, is of dimension 
one when considered as a vector space over K. 


Proof. Let 0 Aw € Nx and x € L((w) — D) where D is some divisor. We 
claim that zw € Qx(D). By the proof of the previous Lemma, we know 
that rw vanishes on Ax ((x) + (w)). Since x € L((w) — D) we have 


(rz) + (w) 2 -((w) - D) +) =D, 


and so xw vanishes on Ax(D). This establishes the claim. 

Now let w,w’ € Ox be non-zero differentials. By the previous paragraph 
we see that L((w)—D)w and L((w’)—D)w’ are both F-subspaces of 0x (D). 
If we could show that these subspaces have a non-zero intersection, the 
proposition would follow immediately. The idea of the proof is to force this 
to happen by a suitable choice of D. 

Let P be any prime, and set D = —nP, where n is large and positive 
(how large will be determined shortly). By Proposition 6.7, 


dimr Q(—nP) = l(-nP)+ndegP+9-1=ndegP +g —1. 


Recall L(—nP) = (0) since any element in it would have no pole but would 
have a zero at P. 
Using Riemann’s inequality we find 


dimp L((w) + nP) > deg(w) + ndeg P—g +1, 
and similarly for w’. Thus, 


dimr L((w)+nP)w + dimp L((w’) +nP)w' 
> 2ndeg P + deg(w) + deg(w’) — 2g + 2. 


It follows that for large enough n the sum of the dimensions of the two 
subspaces L((w) + nP)w and L((w’) + nP)w’ exceeds the dimension of 
the ambient space Q%(—nP). By linear algebra, they must have a non- 
zero intersection. Thus, there exist z,y € K* such that zw = yw’ and so, 
w = ry hw. 

Corollary 1. Let 0 #w € Ox and let D be a divisor. Then there is an 
F-linear isomorphism between L((w) — D) and Ox (D). 
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Proof. In the proof of the proposition we showed that L((w) — D)w 
Q%¢(D). So it just remains to show that this inclusion is an equality. Let 
w' € Qx(D). By the proposition, there is an element x € K such that 
w’ = zw. Since w’ vanishes on Ax(D) we must have D < (w’) = (x) + (w) 
by Lemma 6.9. Thus, (x) > D — (w) = —((w) — D); ie., e € L((w) — D). 


Corollary 2. All the divisors of non-zero differentials fill out a single 
divisor class. This class is called the canonical class of K. 


Proof. If w,w’ € Qx are non-zero, there exists an x € K* such that 
w’ = tw by the proposition. By Lemma 6.9 we have (w’) = (x) + (w) so 
that (w’) and (w) are in the same class. Conversely, if D is in the class of 
(w), D = (x) + (w) for some x € K*. Thus, D = (rw), the divisor of a 
differential. 


Proof of the Riemann-Roch Theorem. By Corollaries 1 and 2 to 
Proposition 6.10 and Proposition 6.7, we find 


l(D) = deg D—g+1+4+1((w) —D). 


This is the assertion of the Riemann-Roch theorem given in the last chapter, 
Theorem 5.4. We see that the divisor C in the statement of that theorem 
can be taken to be any divisor of a non-zero differential. We now have a 
complete proof of the Riemann-Roch theorem! 


Using Theorem 5.4 and its corollaries, we see that the constant c in the 
statement of Riemann’s theorem can be taken to be 2g — 1 and for any 
differential w the degree of (w) is 2g — 2. 


Finally, we want to decompose a differential into a sum of local pieces 
analogous to the sum of the residues construction in the classical case. To 
this end, let’s define a map ip: Kp > Ax. If up € Kp let ip(xp) be the 
adele with all components zero except the P-th component which is equal 
to xp. Clearly, ip is an F vector space isomorphism of Kp with its image. 

Let w € Ox. We define wp € Homp(Kp, F) by wp(rp) = w(ip(xp)). 
This process associates a family of local functionals {wp | P € Sx} to a dif 
ferential. Knowing this family, we would like to reconstruct the differential 
and its divisor. 

The functionals wp are not arbitrary. They must vanish on some power 
of the maximal ideal P Cc Op. Indeed, wp(rp) = 0 if ip(xp) € Ax((w)) 
and this inclusion holds if ordp(zp) > —ordp(w). Thus, wp vanishes on 
P-°erdp() | This shows the functionals wp are continuous in the P-adic 
topology. 


Proposition 6.11. Let w € QK and € = (xp) € Ax. Then, for all but 
finitely many P we have wp(xp) = 0 and 


= SS wp(xp). 
P 
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Proof. Let S be the finite set of primes where either ordp(w) < 0 or 
ordp(xp) < 0. If P ¢ S then xp € Op and so wp(rp) = 0 by the remark 
preceding the Proposition. Define a new adele &’ whose P’th component is 
rp ifP ¢ Sand Oif Pe S. Then, €’ € Ax((w)) and € = €'+ Si peg ip(Zp). 
Thus, 


w(€) = w(f’) + S° w(ip(xp)) = S> wp(ap) = YS wp(zp). 
P 


Pes Pes 


The next proposition provides the abstract analogue of Lemma 6.1. It 
enables one to recover the divisor of w from properties of the local func- 
tionals wp. This will be very useful in the proof of the Riemann-Hurwitz 
formula. 


Proposition 6.12. Let0 Aw € Ox. Then, N = ordp(w) is determined 
by the ung two properties; wp vanishes on P-% but does not vanish 


on P-N 


Proof. We have already seen in the remarks preceding Proposition 6.10 
that wp vanishes on P-°r4P), It remains to show that w p doesn’t vanish 
on P-ordp(~)-1 We know, from Lemma 6.8 and the definition, that w does 
not vanish on Ax ((w) + P). Let € € Ax((w) + P) be such that w(é) 4 0. 
As usual, write € = (zg) with Q varying over all primes. By Proposition 
6.11, 
0 #u(€) =wp(tp)+ > we(zq) = wp(xp). 
QAP 


The last equality follows from the fact that ordg((w) + P) = ordg(w) for 
Q#P. 

Since € € Ax ((w) +P), we must have zp € P~4?™)-1, This concludes 
the proof. 


Corollary. A differential w is completely determined by any local compo- 
nent wp. That is, ifw,w' €CQNK andwp = wp thenw =u". 


Proof. If wp = wp then (w — w’)p = 0. The proposition shows that if 
Ww —w’ were a non-zero differential no local component could be the zero 
map. Thus, w —w’ =O;ie.,w =w’. 


We have now accomplished all the goals set out for this chapter except 
the statement and proof of the strong approximation theorem. This im- 
portant theorem, strictly speaking, has nothing to do with differentials. 
However, its proof is an easy consequence of material developed earlier, 
namely, Corollary 1 to Proposition 6.7. 

Let’s first recall a version of the weak approximation theorem. Suppose 
K is a field and O1,O9,...,O; a collection of subrings of K which are 
discrete valuation rings with quotient field K. Let P; C O; be the maximal 
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ideal of O;. Finally, suppose we are given a set of elements a; € K and a set 
of positive integers n; with i varying from 1 to t. The weak approximation 
theorem asserts that there is an element a € K such that ordp (a—a;) > nj 
for i = 1,2,...,t. The proof, which is not hard, can be found in many 
sources, e.g. see Lang [5]. The strong approximation theorem in function 
fields is an assertion of similar type, but with much greater constraints on 
the element a € K. 


Theorem 6.13. Let K/F be a function field and S C Sx a finite set of 
primes. For each P € S let an element ap € Kp and a positive integer np 
be given. Finally, let’s specify a prime Q ¢ S. Then, there is an element 
a€K such that ordp(a —ap) > np for all P € S and ordp(a) > 0 for all 


P ¢ SU{Q}. 


This theorem is called the strong approximation theorem. Before begin- 
ning the proof, two remarks are in order. First, the added generality of 
choosing the ap € Kp is very small. If we prove the theorem with the 
ap € K, then the full theorem takes just a trivial extra step. The main 
point is that in addition to the conditions at the primes in S we have added 
the infinitely many conditions that the element a be integral at all primes 
not in S with the one exception of Q. 


Proof. Define an adele € = (xp) by the conditions that rp = ap for PE S 
and rp = 0 for P ¢ S. Next, define a divisor D = mQ — dipegnp P. 
Choose the integer m so large that the degree of D exceeds the constant 
cin Riemann’s theorem. Then, by Corollary 1 to Proposition 6.7, we have 
Ax =K+Ax(D). In particular, € =a+7 wherea€ K and 7 € Ax(D). 
In other words, € —a € Ax(D). Examining this relation, component by 
component, shows that a has the desired properties. 


Exercises 


1. Let w be a meromorphic differential on a compact Riemann surface 
X. Show that w is zero on A(O) if and only if w has no poles. 


2. Let M be the field of meromorphic functions on a compact Riemann 
surface X and 22 the space of meromorphic differentials on X. Show 
in detail that Q is a one-dimensional vector space over M. 


3. Show directly (i.e., arguing only with differentials) that dimQ%(D) = 
0 if deg D > 2g — 2. 


4. Suppose that D is a divisor of degree zero, but that D is not principal. 
Show dimp Qx(D) =g-1. 


5. If D is a divisor, and deg D < g — 1, show that dimrp Qx(D) > 0. 
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10. 


11. 


12. 


13. 


14. 


15. 
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. Suppose that w € 2%(O) and has a zero P of degree 1 and that 


ordpw > g. Show that P is a Weierstrass point (see Exercise 10 of 
Chapter 5). 


. In this and the following two exercises, we assume that fF is alge- 


braically closed. Let P be a prime. Assume the genus g is greater 
than 0. Show Q,%(P) is properly contained in Q,« (OQ). 


. (Continuation) Suppose g > 1 and 0 < n < g. Show there exist primes 


{P,, P.,...,P,} with the property, dim0Q,(P,+Po+--:-+P,) = g9—-n. 


. (Continuation) Suppose g > 1. Show there are primes {P,, P2,...,P,} 


such that P; + Po +---+ P, is not the polar divisor of any element 
of K*. 


Suppose w, and w2 are two Weil differentials with the same divisor. 
Show w, = aw, for some a € F*. 


Let o be an automorphism of K which leaves F’ fixed. Let P be a 
prime of K and oP the prime obtained by applying a to P (see 
Exercise 13 of Chapter 5). Show that o extends to an isomorphism 
of K p with K, p. Show further that o induces an automorphism of 
Ax which is F-linear and maps K to itself. 


(Continuation) If w is a Weil differential, define cw: Ax + Ax by 
the equation ow(a) = w(o~'a) for all a € Ax. Show that ow is a 
differential. 


(Continuation) Let D be a divisor of K. If w € ONx(D) , show that 
ow €ONK(cD). 


(Continuation) From the last exercise we see that o induces an au- 
tomorphism of 0Q,(O). If F' is algebraically closed and g > 1, show 
there is a differential of the first kind w such that o(w) = (w). 


(Continuation) Assume F' is algebraically closed and that the genus 
g of K is > 2. Show there is an integer k with 1 < k < 2g — 2 and 
a prime P such that o* leaves P fixed. (This series of exercises was 
inspired by the paper of Iwasawa and Tamagawa [1], where it was 
proved that the automorphism group of a function field of genus 2 or 
greater is finite. In characteristic zero this result is due to A. Hurwitz. 
In charactersitic p the first proof was given by H. Schmid [1]). 


i 


Extensions of Function Fields, 
Riemann-Hurwitz, and the 


ABC ‘Theorem 


Having developed all the basic material we will need about function fields 
we now proceed to discuss extensions of function fields. This material can 
be presented in a geometric fashion. Function fields correspond to algebraic 
curves and finite extensions of function fields correspond to ramified covers 
of curves. In this chapter, however, we will continue to use a more arithmetic 
point of view which emphasizes the analogy of function fields with algebraic 
number fields. 

Let K/F be a function field with constant field F and let L be a finite 
algebraic extension of K. Let FE be the algebraic closure of F in L. It is 
then clear that L is a function field with EF as its field of constants. Recall 
that in this book, a “function field” over F refers to a field which is finitely 
generated over F' and of transcendence degree one. If L = EK, we say that 
L is a constant field extension of K. We will discuss such extensions in 
detail in the next chapter. If = F, we say that L is a geometric extension 
of K. In the general case, we have a tower K C EK C L, where EK/K is 
a constant field extension, and L/EK is a geometric extension. 

Let p denote the characteristic of F’. In the characteristic zero case, all 
extensions are separable and this considerably simplifies the theory. Since 
we will be especially interested in the case where the constant field F is 
finite, we must also deal with the theory when p > 0 and thus with questions 
of inseparability. Instead of working in complete generality we will often 
compromise by assuming that F' is perfect, i.e., that all algebraic extensions 
of F are separable. This holds if F has characteristic zero or is algebraically 
closed or is finite. These cover all the cases of interest in this book. 
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This chapter falls naturally into three parts. In the first part we recall 
some basic facts about extensions of discrete, rank one valuations and, also, 
the theory of the different and its application to questions of ramification. 
Here we will assume the reader is somewhat familiar with this material so 
that the proofs will only be sketched. In the second part we will discuss 
how differentials behave in extensions. This will lead to the proof of the 
Riemann-Hurwitz theorem, one of the most important and useful theorems 
in the subject. Finally, we will discuss the so-called ABC-conjecture of 
Oesterlé-Masser and give a very simple proof in function fields using an 
idea the author learned from W. Fulton. Several applications of this result 
will be given, e.g., a proof of Fermat’s last theorem for polynomial rings. 


Let L/K be a finite algebraic extension of fields. We will use the abbre- 
viation “dvr” for a discrete valuation ring. Let Op be a dvr in K having 
K as its quotient field. Denote its maximal ideal by P. Let Og be a dvr 
in L with maximal ideal $B. We say that Og lies above Op or that P lies 
above P if Op = KN Og and P = NM Op. The notation P|P for this 
relation is often useful. There are two integers associated to this situation, 
f = f(/P), the relative degree, and e = e(8/P), the ramification index. 
To define f, note that Og/ is a vector space over Op/P. The relative 
degree is defined to be the dimension of this vector space. We shall see 
shortly that it is finite. Next, POg is a non-zero ideal of Og contained 
in $B. Thus, POs = §B° for some integer e > 1. This integer is called the 
ramification index. It is easy to see that e is characterized by the following 
condition; for all a € K, ordg(a) = e ordp(a). 

The ramification index and the relative degree behave transitively in 
towers. More precisely, let K C L C M be a tower of function fields with 
L/K and M/L finite, algebraic extensions. If $8 is a prime of M and p and 
P are the primes lying below $ in L and K respectively, then, e(B/P) = 
e(B/p)e(p/P) and f(8/P) = f(P/p) f(p/P). Both relations follow easily 


from the definitions. 


Proposition 7.1. With the above notations, ef <n =|L: K], the dimen- 
sion of L over K. 


Proof. Let II be a generator of $8 and choose w1,w2,...,Wm such that their 
reductions modulo $B are linearly independent over Op/P. We will show 
that the em elements w,IIJ with 1 < i < mand 0 < j < e are linearly 
independent over K. This is sufficient to establish the result. 

Suppose 


e—l ™ 


So So aijwill = 0 


j=0 1=1 


is a linear dependence relation over K. If the a;; € K are not all zero we 
can assume they are all in Op and at least one of them is not in P (since 
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K is the quotient field of Op and Op is a dvr). Consider the elements 


mm 


A; = Ajj Wy. 


i=1 


If some aj; ¢ P, then A; is a unit in Og since its reduction modulo 
is not zero. Otherwise, A; is divisible by 7, the generator of P, and so 
ordy(A;) > e. Thus, ords (S75 A,Il’) = j, for some jo < e. This is a 


* ae . —| ; 
contradiction since }°_5 A;IP’ = 0. 


If we assume L/K is a finite and separable extension, then we can 
construct all the $8 lying over P as follows. Let Op be as above, and 
let R be the integral closure of Op in L. R is a Dedekind domain. Let 
PR = p{'ps?...pg’ be the prime decomposition of PR in R. The set 
{p1,P2,...,P g} is the complete set of non-zero prime ideals of R. For 
each 2, the localization Ry, is a discrete valuation ring with maximal ideal 
B;, = piRy,. Define Op, = Ry,. Then {Op,,Op,,...,Op,} is the com- 
plete set of dvrs in L lying above Op. Let f; and e; be the relative degree 
and ramification index of $8; over P;. By standard properties of localiza- 
tion, the exponents in the decomposition of PR are indeed the same as the 
ramification indices defined earlier. 


Proposition 7.2. Assume L/K is a finite, separable extension of fields. 
Then, with the above notations, )-7_, ef; =n =[L: K]. 


Proof. Since L/K is separable, the trace from L to K, try x, is a non- 
trivial K-linear functional on L. Using this, one can prove that R is a free 
module over Op of rank equal to n = [L : K| (see Samuel and Zariski [1]). 
Thus, R/PR is a vector space over Op/P of dimension n. 

Now, PR = pj'p5?...pg’ and so, by the Chinese Remainder Theorem, 


R/PR& R/p? ® R/p? © ++ @ B/pe. 


Again, by standard properties of localization, for each index 7 we have a 
ring isomorphism 
R/p;" > Ox, /B5". 
The latter ring is a vector space over Op/P, and we calculate its dimen- 
sion using the filtration 


POg, = Bi CPE" C+ CPi C Op,, 


Since $8; is principal, the successive quotients are one dimensional over 
Ox, /3B;. This ring is f; dimensional over Op/P (by definition). Thus, the 
total dimension of Os, /SBf* over Op/P is e; fi. 

Summing over i gives n = )>7_, ex f; as asserted. 


Having dealt with the separable case we now prove a simple fact about 
the purely inseparable case. 
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Lemma 7.3. Let L/K be a purely inseparable extension of degree p, the 
characteristic of K. Assume K = L? (this strange assumption is often 
correct in function fields). Suppose Op C K is a dur with quotient field K. 
Then there is one and only one dur Om C L above Op. Moreover, e = p 
and f=1soef=p=[L: K). : 


Proof. Let R= {réL |r? € Op} and P= {reL |r? € P}. It is easy 
to see that R is a ring, B is a prime ideal in R, and BOOp = P. We will 
show that F is a dvr. 

Let 7 be a generator of P. Since L? = K, there is an element II € L with 
II? = 7. Clearly, II € $8. We claim that every element t € L is a power of 
II times a unit in R. Once this is proved, it is almost immediate that II 
generates $B and that RF is a dvr. 

Now, t? € K so that t? = u7® where u is a unit in Op and s € Z. Thus, 
(t/II*)? = u which shows that t/II° € R. Since (II*/t)? = u7* € Op it 
follows that II°/t € R as well. Thus, t is a power of II times a unit as 
claimed. 

If Og C L is any other dvr lying over Op, let t be one of its elements. 
Then t? € K M1 Og = Op so that t € R. We have shown Ogy C R. Since, 
as we shall show in a moment, dvrs are maximal subrings of their quotient 
fields, we have R = Og, which establishes uniqueness. 

To prove the maximality property of dvrs, let O C K be a dvr with 
quotient field K and uniformizing parameter 7. Let O’ be a subring of K 
containing O. Suppose there is an element r € O’ with r ¢ O. Then, there is 
a unit u € O such that r = ux—” with n > 0. Then, 77! = u7!r™~!r € O' 
and it follows that all powers of 7, both positive and negative, are in O’. 
Since every element of K is equal to a unit of O times a power of 7, we 
conclude that if O 4 O’, then O’ = K. 

Finally, ordg(7) = p so e = p. By Proposition 7.1, ef < p, and it follows 
that f = 1, as asserted. 


A field F is called perfect if every algebraic extension is separable. ‘This 
is automatic in characteristic zero. In characteristic p > 0, it is well known 
that F is perfect if and only if F = F?. We use this criterion in the next 
proposition. 


Proposition 7.4. Let F be a perfect field of positive characteristic p, and 
K a function field with constant field F. Then, |K : K?] = p. 


Proof. Let x be an element of K not in F. Then [K : F(x)] < oo. Consider 

F(x)? = F?(x?) = F(x? ). It is clear that [F'(x) : F(x”)] = p. For example, 

one shows easily that {1, 2, 27,...,2?~!} isa field basis for F'(x) over F(x”). 
Thus, the proposition follows from the equation 


(A: P(a?)| = [Ks F(x) |[F (a) : F(e?)| = [Ks KPA? : F(a?)), 
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if we can show [K : F(x)] = [K? : F(x?)]. To show this, let {wy, w2,...,wWm} 
be a field basis for K over F(x). We claim that {w?.w5,...,w?,} is a field 
basis for A? over F'(x?). This is a straightforward calculation. 


Corollary. Let K be a function field of characteristic p > 0 with perfect 
constant field F. Let L be a purely inseparable extension of K of degree p. 
Then, F is the constant field of L and LP=K. 


Proof. Suppose a € L is a constant. By definition, it is algebraic over F’. 
Since L/K is purely inseparable of degree p, a? € K and is algebraic over 
F’, This implies a? € F. Since F = F? there is a G € F with a? = 6? which 
impliesa= Be F. 

Applying the proposition to L we see [[ : L?] = p. However, since L/k 
is purely inseparable, L? C K. It follows that [K : L?] =1 and so K = L?. 


Proposition 7.5. Let K be a function field with a perfect constant field F’. 
Let L be a finite extension of K, and M, the mazimal separable extension 
of K in L. Then, the genus of M is equal to the genus of L. Also, for 
each prime p of M there is a unique prime % in L lying above it. Finally, 
e(B/p) = [L: M] and f(P/p) = 1. 


Proof. The constant field E of M is perfect since it is a finite extension of 
F’, which is perfect by assumption. 
Since L/M is purely inseparable, there is a tower of fields 


KCM=K) CK, C:::C Kyn-1 C Kn = ZL, 


where for each 1 > 1, K;/K;—1 is purely inseparable of degree p. By the 
corollary to Proposition 7.4, and an obvious induction, we have K;_, = K? 
for each 1 <i <n. Raising to the p-th power is thus an isomorphism from 
Kk; to K;~1, which shows that all these fields have the same genus. This 
proves the first assertion. 

The remaining part of the Proposition is proven by induction using the 
corollary to Proposition 7.4, Lemma 7.3, and the fact that both the relative 
degree and ramification index are multiplicative in towers. 


We are now in a position to prove the theorem at which we have been 
alming. 


Theorem 7.6. Let K be a function field with perfect constant field F’. Let 
L be a finite extension of K of degree n. Suppose P is a prime of K and 
{B1, Bo,..., PB} the set of primes in L lying above P. Then, >-7_, ex fi =n 
where, as usual, e; is the ramification index and f; the relative degree of 


BB; over P. 


Proof. Let M be the maximal separable extension of K in L. Let p; be 
the prime of M lying below 8; and let e{ and f! be the ramification index 
and relative degree of p; over P. By Proposition 7.5, 3; is the unique 
prime of L lying over p;. By Proposition 7.2, the theorem is true for M/K. 
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Thus, 5-7_, ej, f/ = [M : K]. Invoking Proposition 7.5 once more we see 


e(L : M] =e; and f! = f;. Substituting into the sum and noticing that 
[L: kK] =[L: M]|M : K] finishes the proof. 


When L/K is a finite extension of function fields, the conclusion of Theo- 
rem 7.6 holds without any assumption about the constant fields (see Cheval- 
ley [1] or Stichtenoth [1]). The method uses results which are specific to 
function fields, e.g., the degree of the zero divisor of a function and Rie- 
mann’s inequality. We have chosen a different route, which seems more 
natural, but at the expense of having to assume the constant field is per- 
fect. 

Recall the notation Dx and D, for the divisor groups of K and L, 
respectively. We introduce homomorphisms Nz/;% and iz,/x% as follows: 


1. NrjK : Dy > Dx is defined by Nz/x% (PB) = f(P/P)P for all primes 
%3 € S; and then extended by linearity. Here, P is the prime of K lying 
below $B, i.e, P= BOK . Nz 7x is called the norm map on divisors. 


2. ink : Dk — Dz is defined by iz/xK(P) = digipe(B/P) P for all 
P € Sx and then extended by linearity. iz/% is called the extension of 
divisors map, or, sometimes, the conorm map. 


A simple consequence of these definitions and Theorem 7.6 is that Nz,/x0 
in/K is the map “multiplication by [L : K]” on Dx. Thus, iz/% is one to 
one (which is obvious anyway) and the quotient group Dx /Nz/K(DL) is 
annihilated by [LZ : K] (which is perhaps not so obvious). 

It is important to determine how these maps interact with the degree 
maps. Suppose that F and E are the constant fields of K and L, respec- 
tively. Recall that for a prime % of L, deg, ($8) is the dimension of On /B 
over £. Similarly, for a prime P of K, degx(P) is the dimension of Op/P 
over F’. These degree maps are then extended by linearity to Dz and Dx. 


Proposition 7.7. Let le Dz, and A € Dx. Then 


Lik 
degy Nix (2) =[E: F]deg, & and deg, (tz/K(A)) = Bow ee A. 


Proof. Both facts follow from the calculation 
[Op /B : F) = [Op /P : E\lB : F] = (Op/P : Op/P\lOp/P : F), 


which shows that [F: F| deg, % = f(B/P) deg, P. 

To show the first assertion, we see it is sufficient to do it when 2 
$8 is a prime divisor. In that case, degy Nr/K (PB) = degx f(P/P) P 
f(B/P) degx P= |E: F) deg, P. 

To show the second assertion, it is again sufficient to consider the case 
where A = P is a prime. Then, 


| 
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deg, tnjK(P) = » e(P/P) deg, P 
$B | P 
ER DL (B/P)IB/P) dee x P 
BP 


| 


The result is now immediate from Theorem 7.6. 


We also would like to investigate how these two maps behave on the 
group of principal divisors. Recall that if a € K* its divisor is defined to 
be (a)K = )/pordp(a)P, where the sum is over all P € Sx. Similarly, one 
defines the principal divisor (b); for an element b € L*. 


Proposition 7.8. 
(i). Ifa e K*, then ip;«(a)K = (a)p. 


(ti). Ifbe€ L*, then Nyx (b)z = (NzK(0))x. 


Proof. To prove the first assertion, one simply computes 


in/K(@)K = tL/K S  ordp(a) P= S— ordp(a) S- e(B/P)B 
P P 93] P 


=) e(P/P)ordp(a) $ = S~ ordy(a) PB = (a)z. 
7 p 


The proof of the second assertion is somewhat more difficult, but stan- 
dard. It follows from general properties of Dedekind domains. A particularly 
elegant proof is given in Serre [2]. A more conventional treatment is given 
in Samuel and Zariski [1]. 


Corollary. The maps iz;~% and Ny; /K induce homomorphisms on the class 
groups Clk and Cl, (which we will designate by the same letters). 


Proof. The proposition shows iz/~% maps Px — Py, and so induces a map 
from Clk = Dk /PK > Cl, = Dz, /P_. Similarly for Ny/K: 


The next topic to consider is that of ramification. Let L/K be a finite 
extension of function fields, suppose 8 is a prime of L lying over a prime P 
of kK. We say that $B is unramified over P if two conditions hold: e(/P) = 
1 and the extension of residue class fields is separable. If either condition 
is not satisfied, we say P is ramified over 8. In a separable extension 
of function fields, only finitely many primes are ramified. This important 
result is a consequence of the theory of the different, which we will now 
sketch without complete proofs. A detailed treatment can be found in the 
above cited references, Serre [2] and Samuel and Zariski [1]. 
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We begin with considerations of some generality. Let L/K be a separable 
extension of fields, A C K a discrete valuation ring with quotient field K, 
and B the integral closure of A in L. One can show that B is a Dedekind 
domain with only finitely many prime ideals. These are the prime ideals 
which occur in the prime decomposition of PB, where P is the maximal 
ideal of A. As an A module, B is a finitely generated free module over 
A of rank equal to [L : K]. Also, the trace of any element of B lies in 
A. Let {21,22,...,2n} be an A basis for B and let 0g/,4 be the ideal in 
A generated by det(trz;x(x;z;)). This ideal is called the discriminant of 
B/A. It is independent of the choice of a basis. Since L/K is separable, the 
discriminant is not the zero ideal. Let 


PB= {I BF 
B.|P 


be the prime decomposition of PB and consider the A/P algebra 
B/PB = B/Py' 6 B/PY G--- dS B/Py:. 


A commutative algebra over a field k is said to be separable if it is a direct 
sum of separable field extensions of k. From general theory, B/PB is a 
separable A/P algebra if and only if det(trg,,4(Z:%;)) 4 0. Here the bar 
refers to reduction modulo PB and we have set B = B/PB and A = A/f. 
It follows easily that every prime in B is unramified over A if and only 
if 03/4 = A, in other words, B is unramified over A if and only if the 
discriminant is all of A. 

Define C'p;4 = {u € L | trz/% (xb) € A, Vb € B}. This set is easily seen 
to be a B submodule of L. In fact, it is the largest B submodule of L whose 
trace is contained in A. We shall show it is a fractional B ideal. Notice that 
BC Cpa. Cpa is called the inverse different of B over A. By definition, 
DBsa = CBA C B is called the different of B over A. 

Since, C'g/4 is a B-module, to show it is a fractional ideal it suffices 
to produce a non-zero element d of L such that dCgja © B. Set d = 
det(trz/%(xi2;)), the element we used in defining the discriminant. If c € 
CBA, then 


Tr 
c= ) ra; r, EK. 
i=1 


Multiply both sides by x; and take the trace. We get 


nr 
trp /K (crs) = Sori trp /K (Li2;). 
i=1 
It follows from Cramer’s rule that dr; € A for all i, and so dCg JA GB 
as asserted. This argument tells us a bit more. Since Cg JAG d~-'B we 
must have 0g/4B =dB C Dg/4q; ie., the discriminant is contained in the 
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different. The connection between the different and discriminant is even 
closer as we see from the following proposition. 


Proposition 7.9. i) Let A be a dur with maximal ideal P, K its quotient 
field, L a finite separable extension, and B the integral closure of A in L. 
Then, some prime above P in B is ramified if and only if0B/4 C P. 

ii) NujkDB/a =%B/a. In words, the norm of the different is the discrim- 
nant. 


Proof. We have already given the proof of 7) in the above discussion. For 
the proof of part iz) see Serre [2]. 


We will say that B/A is unramified if no prime of B is ramified over 
A. From the above proposition, it follows that if Dg/;4 = B, then B/A is 
unramified. Much more is true, however. A prime 8 of B is ramified over 
A if and only if $ divides Dg 4. The easiest way to see this is to pass to 
completions. . 

For each prime 8 C B lying over P consider the completion Ly of L at 
‘3. The closure of K in Lx is isomorphic to K p, the completion of K at 
P. We also complete A at P and B at 8 to obtain the rings Ap C By. It 
is not hard to show that Bog is the integral closure of Ap in Lg. This local 
situation has all the ingredients of the “semi-local” situation considered 
above, so in exactly the same way we can define the local discriminant and 
the local different, 0B/a(P) C Ap and DB a(P) Cc By. 


Lemma 7.10. We have 0g/4(P) = Vp/aAp and Dp/a(B) = Dp/aBy. 
In other words, if 0pg;4 = P*, then dg/a(P) = Pt and if 6 is the exact 
power of B dividing Dg, then Dp a(P) = 135 

For the proof of this result we refer the reader to Serre [2], Chapter 3. 


Corollary 1. As in the Lemma, let 6 be the exact power of % dividing 
D pia. Then 6 can be characterized as the largest integer m such that the 


trace from Lp to Kp of G-m 1s contained in Ap. 


Proof. From the definition, ifm has the property described in the corollary, 
the local inverse different is $~™ and so Dg 4(P) = P™. The result is 
then immediate from the lemma. 


Corollary 2. With the same notation as Corollary 1, 6 > e($3/P)—1 with 
equality holding if and only if the characteristic of F is either zero or does 


not divide e(B/P). 


Proof. (sketch) Neither e(9$8/P) nor 6 changes after passing to the com- 
pletion (for 6 this follows from the lemma). So, we can assume A and 
B are complete. Again, nothing essential changes if we replace K by the 
maximal unramified extension of K in L and A by its integral closure 
in this extension. We can thus assume is totally ramified over P. Set 
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= e(%/P) = [L: K]. Let m € $ be a uniformizing parameter, and 
f(x) = Sf, ai’ be the monic irreducible polynomial for 7 over K. f(z) 
is an Eisenstein polynomial (see Serre [2], Chapter 3) and in particular, 
a; € P for 0 <i <e. Under these circumstances, {1,7,...,7°~'} is a basis 
for B over A and one can show that Dg 4 = (f'(7)). Now, 


f'(r) =er® 4+ (e- L)ae_1n** tees tay, 


Every term in the sum except possibly the first is divisible by 7° and the 
first is divisible by 7°—!. The first assertion of Corollary 2 follows from this. 
The first term of the sum is exactly divisible by 7°~* if and only if either 
the characteristic of F is zero or does not divide e. This proves the second 
assertion. 


Let L/K be a finite extension of function fields, 8 a prime of L and P 
the prime lying below it in K. We say that {8 is tamely ramified over P if 
it is ramified and either the characteristic of the residue class field On /$ 
is zero or does not divide e($8/P). The second assertion of Corollary 2 
can then be reworded to assert that for a tamely ramified prime ‘PB, the 
exponent to which it divides the different is e(®/P) — 1. 


Theorem 7.11. With the above notations and hypotheses, a prime 3B of 
B is ramified over A if and only if B | Deva. 


Proof. (sketch) The definition of unramifiedness is in two parts; the ram- 
ification index must be one, and the residue class field extension must be 
separable. If one ignores the second condition one can refer to the above 
Corollary 2 to Lemma 7.10 for a proof of the theorem. We proceed some- 
what differently and handle both conditions at once. 

By standard properties of localization and completion, a prime B of B 
is ramified over P if and only if % is ramified over P. So we can work in 
the local situation Boy /Ap. The advantage here is that By has only one 
prime ideal, namely, 1. Thus, using what we know about discriminants, Dt 
is ramified over P if and only if dz jA(P) # Ap. By Proposition 7.9, applied 


to this situation, we see this is true if and only if Dg /4(P) F Bay. Finally, 
by Lemma 7.10 this last condition holds if and only if Dg , is divisible 


by $B. 


We have now developed enough theory to enable us to return to function 
fields. We will define the different divisor and explore its properties. From 
now on, we suppose L/K is a finite separable extension of function fields 
with EF the constant field of L and F' the constant field of K. It is easy to 
see that E’/F is also a finite, separable extension. For any prime P of K we 
let Rp be the integral closure of Op in L. As we have seen, the primes of 
L lying above P are in one to one correspondence with the non-zero prime 
ideals p of Rp. If p is such an ideal let Og be the localization of Rp at p 
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and, of course, the maximal ideal of Og is % = pOg. Note that the pair 
Rp, Op is playing the role of the pair B, A in the above considerations. 

For any prime % of L, let p = Rp and let 6(98) be the exact power 
of p dividing the different of Rp over Op. We define the different divisor 
of L/K as follows: 


Dijk = > HP) P. 


BEesy 


Actually, we must prove that all but finitely many 6(8) are zero before we 
can be sure this definition makes sense. By our previous work, a prime 8 
is unramified over P if and only if 6($8) = 0. Once we prove the finiteness 
assertion, it will follow that only finitely many primes in DL can be ramified 
over K. 

To prove the finiteness result, let {21,22,...,¢,} be a basis of L over K. 
Let S C Sx be the set of primes of K which lie below a pole of some 7;. 
Since there are only finitely many such poles, the set Sis finite. For P ¢ S 
we have xz; € Rp for 1 <i <n. This follows from the fact that Rp is the 
intersection of the valuation rings containing it (Rp is a Dedekind ring and 
has quotient field L). Let Cp be the inverse different of Rp over Op. Let 
c € Cp and write c= )>;"_, a,x; with a; € K. Then, for all 1 <j <n, 


try/;K(cx,) = Sastry (wix,). 


i=1 


Using Cramer’s rule, as we have previously, we find da; € Op forl1 <i <n, 
where d = det(trz/x(x;r;)). Thus, dCp C Rp and so Rp C Cp Cd7'! Rp. 
Now, ordy(d) = 0 for all but finitely many $8 and, consequently, d is a unit 
in Rp for all but finitely many P € Sx. This implies Cp = Rp and so 
Drp/Op = Rp for all but finitely many P. The fact that 6(§8) = 0 for all 
but finitely many %B is now clear. 

We summarize our discussion of the different in the following theorem. 


Theorem 7.12. Suppose L/K is a finite separable extension of function 
fields. The different Dy; x defined above is a divisor with the property that 
a prime $B of L is ramified over K if and only if it occurs in Dy; K with 
a non-zero coefficient. In particular, only finitely many primes of L are 
ramified over K. 


We note that separability is crucial for the last assertion of the theorem. 
If Fis perfect and L/K is purely inseparable of degree p, then Lemma 7.3 
shows that every prime of L is ramified over K. 

Because it is often useful, we record an important property of differents. 
The proof is not hard, but will be omitted. Suppose K C L C M is a tower 
of function fields with M/K finite and separable. Then, 


DuyjK = Duji+imyt(Pr/k) 
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The next topic will be the behavior of differentials in field extensions. 
Let w be a Weil differential of a function field K and suppose L is a finite 
separable extension of K. We want to associate to w a differential w* of L. 
After this is done the next task will be to relate the divisor of w* to the 
divisor of w. 

We begin by extending of the trace map from L to K to a trace map 
from the adeles of L, Az, to the adeles of K, Ax. The key to this is the 
following important isomorphism 


L@xK Kp ~C— Lp . 
B|P 


The map involved can be described quite easily. Identifying D as a subfield 
of Ley and Kp as a subfield of Lx there is an obvious K-bilinear map 
from L x Kp to Lg, namely, (¢,a) — fa. Thus, for each |P, there is a 
map from L @x Kp > Ly. Now pass to the direct sum. The fact that the 
resulting homomorphism is an isomorphism is given in Serre [2], Chapter 
3. Note that L is embedded diagonally into the right-hand side. 

Both sides of the above isomorphism are Kp algebras and the isomor- 
phism respects this structure. If {71,22,...,2%,} is a basis for D/K, then 
{x1 ® 1,22 @1,..., Ln @ 1} is a basis for the left-hand side over Kp. On 
the other hand, choosing a basis for Ly over Kp for each %B|P and putting 
these together gives a basis for the right-hand side. Using these bases en- 
able one to prove the following result, which connects the global and local 
traces and norms. 


Proposition 7.13. Let Ti; and Ny denote the trace and norm from Lop 
to Kp, respectively. Then, forx € L we have 


trp;K(@) = S_ Tx(2) and = Nr/K(2) = [| Noy (ax) . 
BP BP 


In words, this says that the global trace is the sum of the local traces 
and the global norm is the product of the local norms. We will be primarily 
concerned with the traces. 

We now define the trace map from Az to Ax. Let a = (ag) be an 
element of Az. We map it to the adele of K, whose P-th coordinate, ap, 
is ) pp Ip(aq). Since for all B, Tr : Ox — Op we see that for all but 


finitely many P, ap is in fact in Op. Thus, the image of our map is indeed 
in Ax. We call this trace map trz;% because it extends the trace map on 
the level of fields. To see this, recall L is embedded diagonally into Ay. 
If A € Az is the adele all of whose coordinates are equal to @, then, by 
Proposition 7.13, trz;%(A) is the adele of K all of whose coordinates are 
equal to trz/x(£). One also checks easily that trz;~ is an F-linear map 
from A; to Ax (recall that F is the constant field of K). 
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Having extended the trace map, it is now relatively easy to define the 
map from QR — Qy,, which we need. Let w € Qe. Define w* to be the 
compositum w otrz/%, which is an F’-linear homomorphism from Az to F’. 
From now on we assume that F is also the constant field of L, i.e., that L/K 
is a geometric extension. We claim that w” is, in fact, a Weil differential 
of L. ‘That it vanishes on L follows from what we have just proved; trz/x 
maps L to K and w vanishes on K by definition. It remains to prove that 
w™ vanishes on A;,(C) for some divisor C of L. 

Since w € OQ there is a divisor C of K such that w vanishes on 


Ax(C) = {(ap) € Ax | ordp(ap) > —ordpC, VP € Sx}. 


Let a = (as) € Ay. Fix a prime P of K and suppose {%1, Bo,..., B,} are 
the primes of L lying above P. We need to ascertain the conditions which 


force 
g 


ordp(>— Tx, (ag,)) > —ordpC . 
i=1 

This will follow if for each 7 individually ordp(T, (ag,)) > —ordpC. Let 
m7 be a uniformizing parameter at P and for simplicity set m = ordpC. The 
last condition is equivalent to the following: for each i, ordp(Ts, (7am, )) > 
0. This will happen if tas, is in the local inverse different at %8;. From 
the definition of the different and Corollary 1 to Lemma 7.10 this condition 
is equivalent to 


ordy, (7’"asg,) > —6(B;) or ordgy, (ag,) > —6(8;) — e(B;/P)ordpC . 


It is easy to check from the definitions that 


S| (6(B) + e(8/P)ordpC) PB = Dr + izsKC. 
p 


To sum up, we have proven— 


Proposition 7.14. Let L/K be a finite, separable, geometric extension of 
function fields and w a non-zero differential of K. The w* =w otrrsK is a 
differential of L. In more detail, if w vanishes on Ax (C), w* is an F-linear 
homomorphism from Ay — F which vanishes on L and on Az(izsKC + 
Dy K). 


We would like to determine the divisor of w*. Recall, by definition, this 
is the largest L-divisor B such that w* vanishes on A,(B). In light of the 


previous proposition, a good guess would be iz/%(w)« + Dr;x. This is, in 
fact, correct. 


Theorem 7.15. With the hypotheses and notation of Proposition 7.14, we 
have 


(w")n = tnyK()K + Disk. 
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Proof. We are going to use Proposition 6.12 of the last chapter, which 
shows how to determine the divisor of a differential using properties of its 
local components. 

We begin by recalling the definition. Let up € Qr. For a prime $ of L let 
Igy : Ly —> A; be the map that takes an element y € Lg to the adele all of 
whose components are zero except the §8-th component which is y. Then 
Lg = [0 tp. 

Now, wy = (wotrrsK) om = wo(try/K om) = wo(tpoTy) = wpolg. 
The third equality follows directly from the definition of the trace map on 
the adeles. In words, the local component of w* at $B is the local component 
of w at P composed with the local trace map. 

According to Proposition 6.11, ordy(w*) = N, where N is the integer NV 
such that wa vanishes on B- N but not on g-N- - 

Wy, vanishes on mQ-N if and only if wp vanishes on Tx (B-N ), which oc- 
curs if and only if Ts, (p-N) C P-™, where m = ordp(w). This is equivalent 
to Tx (P™B-N ne Op, which in turn is true if and only if sye(B/P)m—N C 
- 5(B) by Lemma 7.10 and the definition of the different. We conclude 
that wy, vanishes on G- if and only if N < e(8/P)ordp(w) + 6($). The 
largest N with this property is clearly the right-hand side of this inequality. 
The theorem follows. 


It might be asked, what is the necessity of Proposition 7.14 since Theorem 
7.15 is a more accurate result? The answer is we had to show wu” is a 
differential before we could decompose it into its local components and use 
Proposition 6.11 to determine its divisor. 

We are finally in a position to prove the Riemann-Hurwitz Theorem, one 
of the main goals of this chapter. 


Theorem 7.16. (Riemann-Hurwitz) Let L/K be a finite, separable, geo- 
metric extension of function fields. Then, 


2g, -2= [L K](29x — 2) + deg, Dyk: 


In particular, 


29, — 2 >[L: K](29x — 2) + > (e(P/P) - 1) deg, P. 
p 


where the sum is over all primes % of L, which are ramified in L/K. 
The inequality is an equality if and only if all ramified primes are tamely 
ramified. 


Proof. Let w be a non-zero differential of K. By the remarks following 
Corollary 2 to Lemma 6.10, (w)x is in the canonical class of K. By Corollary 
3 to Theorem 5.4, every divisor in the canonical class of K has degree 
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29x — 2. Thus, deg (w)K = 29x — 2. Similarly, deg, (w*), = 291 — 2. 
From Theorem 7.15 we find 


2gr — 2 = deg, (w*)t = degy tn;x(w)K + deg, Dix. 


From Proposition 7.7 we see that deg, iz;K(w)x = [L: K]degx(w)x = 
[L : K\(2gxK — 2). We have used the assumption that L has the same 
constant field as K. Substituting into the above equality yields the first 
assertion of the theorem. 

The second assertion is an immediate consequence of Corollary 2 to 
Lemma 7.10, since deg; Dz/K = Dig (PB) degy B. 


The Riemann-Hurwitz theorem has a very large number of consequences. 
We will give some idea of how it is used by giving three corollaries. 


Corollary 1. Suppose L/K is a finite, separable, geometric extension of 
function fields. Then, gx < gz. (This need not be true for inseparable 
extensions!) 


Proof. Since the different is an effective divisor (all its coefficients are non- 
negative) we see 2g; —2 > [L: K|(2gxK — 2) > 29x — 2. Thus gx < gr as 
asserted. 

One can prove this result in another way. It follows immediately from 
the theorem that w* is a holomorphic differential (no poles) if w is a holo- 
morphic differential. One checks (using the fact that the trace map from 
Ay to Ax is onto when L/K is separable) that the map w — w” is a one 
to one F-linear map from 2 (0) to 0,(0). Since these two vector spaces 
have dimension gx and g,, respectively, it follows that gx < gr. 


Corollary 2. (Luroth’s Theorem) Let L = F(x) be a rational function field 
over F and K a subfield properly containing F. Then, there isaucEe K 
such that K = F(u). 


Proof. Since K properly contains F’, it is easy to see that [L : K] < ow. 
Let M be the maximal separable extension of K contained in L. If the 
characteristic of F' is zero, then M = L. If the characteristic of F’ is p > 0, 
then L/M is purely inseparable of degree p” for some n > 0. It follows that 
aP” € M. On the other hand, the polar divisor of z in L is a prime (the 
prime at infinity) of degree 1 and so the polar divisor of x?” has degree p”. 
Consequently, [L : F(x”")] = p” by Proposition 5.1. Thus, M = F(z?’ ) 
and M is a rational function field. This shows that we can assume to begin 
with that L/K is separable. Since L has genus zero it follows by Corollary 
1 that gx = 0. Since L has a prime of degree 1, e.g., the zero or pole of z, 
the prime lying below it in K must also have degree 1. It follows that K 
is a rational function field (see the discussion after the proof of Corollary 
5 to Theorem 5.4). 
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Corollary 3. Let L/K be a finite, separable, geometric extension of func- 
tion fields. Assume gr = 1. Then, gx <1 with equality holding if and only 
if L/K is unramified. 


Proof. The inequality gx < 1 follows from Corollary 1. From gz = 1 and 
the theorem we deduce, 0 = [L: K](2gx — 2) + deg, Dz x. If gx = 1 the 
degree of the different is zero and so the different is the zero divisor (recall 
that the different is an effective divisor). From Theorem 7.12 it follows 
that L/K is unramified. By the same theorem, if L/K is unramified then 
DrjK = 90 and so 29x — 2 = 0 or, what is the same, gx = 1. 


We will conclude this chapter with a beautiful application of the Riemann- 
Hurwitz theorem to the proof of the ABC theorem in function fields. Let’s 
begin by recalling the ABC conjecture of Masser and Oesterlé in the case 
of the rational numbers Q. Suppose A,B,C € Z and that A+ B=C. 
Suppose further that the three integers A, B, and C are pairwise relatively 
prime. The conjecture states that for each « > 0 there is a constant M, 
such that if A,B, and C satisfy the given conditions, we have 


max (|A|,|B|,|C|) < Me ( [] »)***. 
p| ABC 


This elegant conjecture has many surprisingly powerful consequences. See 
Lang [4], Chapter IV, Section 7, for a discussion and a number of references. 
At present the conjecture is not proven and many people consider it to be 
beyond the range of the available methods. 

The ABC conjecture for Q can be easily generalized to number fields. 
We omit this formulation here. Instead we reformulate the conjecture over 
Q slightly. In this new formulation it becomes clear what the analogous 
conjecture should be in the function field case. 

Rewrite A+ B=C as A/C + B/C =1. Writeu = A/C and v= B/C. 
Then u,v € Q and u+v = 1. Let’s recall the definition of the height of 
a rational number r. Write r = m/n where m,n € Z and (m,n) = 1. 
Then the height of r, ht(r), is defined to be max(log|m|, log |n|). With 
this notation we can recast the ABC conjecture as follows. Suppose € > 0 
is given. Then there is a constant m, such that whenever u,v € Q* and 
u+v=1, we have 


max(ht(u), ht(v)) <m,.+ (1 +.) S- log p. 
p|ABC 


Here, A and B represent the numerators of u and v and C’ their common 
denominator. 

Now, let’s return to the function field case. Let K be a function field 
and F its field of constants. Suppose u,v € K* and w+vu = 1. We need 
a substitute for the notion of height. Let A be the zero divisor of u and 
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C its polar divisor. A good measure of the size of a divisor is its degree, 
so it is natural to define the height of u to be max(deg A, degC’). This is 
fine, but it can be stated more simply. We know (Proposition 5.1) that 
deg A = degC = [K : F(u)]. Instead of calling this number the height of 
u, it is more conventional to call it the degree of u and use the notation 
deg u. One should be careful though, the degree of the divisor (u) is zero 
(Proposition 5.1), whereas the degree of the element u is greater than or 
equal to zero and is zero only when it is a constant. 

For those with some algebraic geometry background, the degree of an 
element has a nice geometric interpretation. The field K is the function 
field of a smooth, complete curve I’ defined over F’. The element u can 
be thought of as a rational map from I to the projective line P!/F. The 
degree of u is the degree of this mapping. If F’ is algebraically closed, all 
fibers have deg u elements with (possibly) finitely many exceptions. 

Before stating the next theorem, we need two more definitions. 

If D € Dx is a divisor, recall that Supp(D) is defined to be the set of 
primes which occur in D with non-zero coefficient. This set is called the 
support of D. 

Secondly, suppose u € K™* is not a constant. Let M be the maximal 
separable extension of F'(u) inside of K. Then, the field degree, [M : F(u)|, 
is called the separable degree of u and is denoted by deg, u. Note that 
deg, u < degu with equality holding if and only if K/F'(u) is separable. 

We can now state and prove(!) the ABC conjecture for function fields. 


Theorem 7.17. Let K be a function field with a perfect constant field FP. 
Suppose u,v © K* andu+v=1. Then, 


deg,u=deg,u < 29K —2+4+ S- degy P. 
PéSupp(A+B+C) 


Here, A and B are the zero divisors of u and v in K, respectively, and C is 
thetr common polar divisor in K. (Note that no “e” appears in the function 
field version). 


Proof. It is convenient to set k = F(u). We first treat the case that K/k 
is separable and do the general case later. Let n = degu = [K : k]. The 
Riemann-Hurwitz theorem implies that 


29K —-2>—-2n+ S_(e(P/P) —1)deg, P, (1) 


where the sum is over all primes of K and for any such prime P, 8 denotes 
the prime of k lying below P. The point is that since F’ is perfect a prime 
is ramified if and only if its ramification index is greater than one. If its 
ramification index is equal to one, it doesn’t contribute to the sum. 

In the function field k = Fu) we consider three primes Bo, %B1, and 
Boo, which are the zero divisors in k of u, 1—u = v, and 1/u, respectively. 


It is easy to see that A = ix;,(Bo), B = ix/,(P1), and C = is, (Poo). 
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(Thinking of u as a mapping from the curve I to P!, this says that A, B, 
and C’ are the inverse images (as divisors) of 0, 1, and oo). In the above 
sum we are only going to consider primes in the support of either A, B, 
or C’. This will only strengthen the inequality. Consider the sum only over 
the primes in the support of A. We have 


(e(P/Bo) — 1) degx P = deg x (ix Bo) — S- deg x P. 


PéSupp(A) PeéSupp(A) 


By Proposition 7.7, degx(ix/~ Bo) = [K : k] deg, Bo = n. So, the above 
sum is simply n minus the sum of deg, P over those P in the support 
of A. The same considerations prove the analogous result for B and C. 
So, adding the contributions from these three sums and substituting into 
Equation 1 above, we find 


29K —2>n- S- deg, P , 
PéeSupp(A+B+C) 


and that concludes the proof in the case where K/k is separable. 

Now suppose the characteristic p of F is positive and that K/k is insep- 
arable. Let M be the maximal separable extension of k in K. Then K/M 
is purely inseparable of degree p™ for some m. Working with the separable 
extension M/k, we find 


29m —2>[M :k] - Ss” degy, P’ , 
P’e€Supp(A’+ B’+C") 


where A’ = tyy/x(Bo), B’ = ingen ($B1), and C’ = inz~ (Boo). By Proposition 
7.5, we see that for each prime P’ of M there is one and only one prime 
P of K lying above P’ and that deg, P = deg, P’. Since, by definition, 
[M : k] = deg, u, the above inequality can be rewritten as 


29m — 2 => deg, u— s~ deg, P. 
PéSupp(A+B+C) 


Invoking Proposition 7.5 once more, we see gy = gx. This completes the 
proof. 


To show the power of the ABC Theorem, we will give two applications. 
The first will concern solutions to the Fermat equation XN +YN = 1 
in function fields and the second will be the statement and proof of the 
S-unit Theorem, a powerful result with many applications to diophantine 
problems. 


Proposition 7.18. Let K be a function field with a perfect constant field 
F’, Consider the equation XN+YN = 1. We assume that N is not divisible 
by the characteristic p of F. If gx =0 and N > 38, then there is no non- 
constant solution to this equation in K. If gx >1 and N > 69x — 3, then 
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there is no non-constant solution to this equation in K. (By a non-constant 
solution we mean a pair (u,v) € K* — F? such that uN +v% = 1.) 


Proof. Suppose that (u,v) € K? is a non-constant solution. Invoking the 
ABC theorem we find 


max(deg, u ,deg, u™) < 29gn —2+4+ S- degy P, (2) 
PéeSupp(A+B+C) 


where A is the zero divisor of u, B is the zero divisor of v, and C is their 
common polar divisor. We’ll return to this equation in a moment. 

Let M be the maximal separable extension of F(u) in K. By considering 
the tower of fields F(u%) C F(u) C M C K, and noting that F(u)/F(u) 
is separable of degree N (it’s here we use the hypothesis (p,N) = 1), we 
see that deg, uN = N deg, u. Similarly, deg, uv" = N deg, v. 

Next, by comparing the zero divisor of u in M to the zero divisor of 
u in K (as we have done in the proof of Theorem 7.17) we see that 
> PeSupp(A) degy P < deg, u. Applying the same reasoning to v yields 
> PcSupp( B) deg, P < deg,v. Since C is the common polar divisor of u 
and v we have a similar inequality involving C. 

Putting all this together and substituting into Equation 2 yields 


N S- degxr P< 29K -2+ S— degr P , 
PeéeSupp(A) PéeSupp(A+B+C) 


with similar equations involving B and C on the left-hand side. Adding all 
three and rearranging terms gives 


(N — 3) S- deg, P < 69x —6. 
PéeSupp(A+B+C) 


If gx = 0 and N > 3, the left-hand side of this inequality is non-negative 
and the right-hand side is —6. This is impossible and this contradiction 
establishes the first assertion of the Proposition. 

Assume now that gx > 1 and N > 6g, — 3. Then certainly N > 4 so 
N —3 is positive. Dividing both sides of the inequality by N —3 we see that 
6gx —6 / N—3 must be bigger than or equal to one. If 6gx —-6 / N-3 <1 
we get a contradiction. Since this inequality is equivalent to N > 69x — 3 
the proposition is proved. 


Actually, one can get a somewhat better result by a different method. 
Namely, suppose all the hypotheses of the proposition hold and that (u,v) € 
K? is a non-constant solution. Let F(u, v) be the subfield of K generated by 
u and v over F’. We will show in the next chapter that in characteristic zero 
or when (p, N’) = 1 the genus of Fu, v) is equal to (N —1)(N —2)/2. When 
the constant field is perfect the genus of a subfield is less than or equal to 
the genus of the field. Thus, if a solution exists (N — 1)(N — 2)/2 < gx. 
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Put the other way around, if gx < (N —1)(N — 2)/2, there are no non- 
constant solutions. Although this is quite elegant, the solution using the 
ABC Theorem is applicable in many situations where this method fails. 

By the way, the hypothesis about the constant field being perfect is 
superfluous. In the next chapter we will show that in this problem we could 
have replaced F by its algebraic closure F’. Since algebraically closed fields 
are perfect the method applies and gives the result over F’ and a posteriori 
over F’, 

The final result of the chapter involves the notion of S-units. Let K be 
a function field with constant field F and suppose S = {P,, Po,...,P:} 
is a finite set of primes of AK. An element u € K™ is called an S-unit if 
Supp(u) C S, i.e., only primes in S enter into the principal divisor (u). The 
S-units form a group denoted by Us. The map u — (u) is a homomorphism 
from the S-units into the free abelian group of divisors supported on S. 
Every element in the kernel of this map has zero for its divisor. Thus, 
the kernel consists precisely of the constants F*. The degree of a principal 
divisor is zero. Thus the image of this map is a subgroup of the divisors 
of degree zero supported on S. The latter group is free of rank t — 1. We 
have shown that Us/F”™* is free of rank < t — 1 where t is the number of 
elements in S. This tells us something about the multiplicative properties 
of S-units. The next theorem is about an additive property of S-units. 


Theorem 7.19. Let K be a function field with a perfect constant field F. 
Let S be a finite set of primes of K. Then, there are only finitely many pairs 
of separable, non-constant S-units (u,v) such thatu+v=1. (u is said to 
be separable if the field extension K/F(u) is separable). If the characteristic 
of F is zero, then every solution is separable. If characteristic of F is p > 0, 
then the most general solution to X + Y = 1 in non-constant S-units is 
(uP, uv?” ) where (u,v) is a separable, non-constant solution in S-units and 
meZ,m> 0. 


Proof. Assume to begin with that (u,v) is a non-constant, separable solu- 
tion to X + Y = 1 is S-units. By the ABC Theorem we have 


degu < 2gxK —2+ 3 degy P , 
PéSupp(A+B+C) 


with the usual notations. Let M = S$) pe, degx P. Then, the right-hand 
side is < 29% — 2+ M since the supports of A,B, and C are in S. 
Let A = )opega(P) P with each a(P) > 0. Then, degu = degy A = 
> pes a(P) degy P. This shows that for each P € S, 


a(P) degx P < degyx A < 29x -2+M, 


and consequently that a(P) is bounded. Since A is a divisor with support 
in a finite set of primes with bounded coefficients, A must be one of only 
finitely many divisors. Similarly for B and C. It follows that the number 
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of possibilities for the principal divisor (u) is finite and similarly for (wv). 
For each of these possible principal divisors choose an S-unit u; and v;. We 
suppose that 1 <i <land1< 7 < k. Since any two non-zero elements 
of K have the same divisor if and only if they differ by a constant, all 
the non-constant, separable, S-unit solutions to X + Y = 1 have the form 
(au;, Gv;) with a, @ € F. If there are more than /k such solutions, then by 
the pidgeon hole principal we can find a given pair of indices (2,7) and two 
distinct pairs of constants (a, 3), (a’, 6’) such that 


au; + Gv; =1 and a’ujt Gv; =1. 


Subtracting these two equations, we find that u,; is a constant times v;. 
Substituting into the first equation shows that u; is a constant. This is a 
contradiction, so we have shown there are only finitely many non-constant, 
separable, S-unit solutions to X + Y = 1. 

Now suppose u and v are non-constant S-units and u+v = 1. If wu is not 
separable, let M be the maximal separable extension of F(u) in kK. Then, 
[kK : M] = p™ for some positive integer m. By the corollary to Proposition 
7.4 we see that u and v = 1 — u are p-th powers. Write u = uj and v =v} 
with u,,v, € K. Note that, in fact, uj,v, € Ug. Since p is the characteristic 
of K,l=utv=uptol? =1 = (u+v1)? which implies uj + v1 = 1. 
If u; is separable, we are done. If not, repeat the process and we find two 
S-units ug and v2 such that uy = ud, vy = vb, and ug + v2 = 1. Note that 
U= ub and v = ve Thus, if wa is separable we are done. If not, continue 
the process. This must end in finitely many steps since a non-constant in 
K cannot be a p™ power for infinitely many m. This is easy to see. For 
example, if u is not a constant, let P be a prime which is a zero of u. If u 
is a p™ power then p™ divides ordp(u) which bounds m. The proof is now 
complete. 


Corollary. Suppose K is a function field over a finite field F. Suppose N 
is greater than 3 and is relatively prime to the characteristic of F. Then, 
XN +Y% =1 has at most finitely many non-constant separable solutions 


ink. 


Proof. Suppose (u,v) € K? is a non-constant solution. In the course of 
proving Proposition 7.18, we proved that 


(N — 3) S- deg, P< 69K -6, 
PéSupp(A+B+C) 


where A is the zero divisor of u, B is the zero divisor of v, and C' is their 
common polar divisor. Assuming N > 4, this shows that for any prime P 
in the support of either u or v we must have deg, P < (69x —6) / N—3. 
In a function field over a finite field there are at most finitely many primes 
whose degree is below a fixed bound (in Chapter 5 we gave estimates for 
the number of such primes). Let S be the set of all primes in K whose 
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degree is less than or equal to (6gx — 6) / N — 3. Then every solution 
to XN + Y% = 1 in K is an S-unit. The corollary now follows from the 
theorem. 


Notice that the assumption that a solution be separable is essential since 
if (u, v) is a solution, then (u?” ,v?” ) is also a solution for all m > 1. 

We have just given a taste of the possible applications of the ABC The- 
orem in function fields. For more, see the paper by Silverman [2] and the 
book by R.C. Mason [1]. 

The restriction on the constant field in the corollary to Theorem 7.19 is 
not necessary. One could apply the classical theorem of de Franchis from 
algebraic geometry, which states, in part, that if K/F is a function field 
there are only finitely many subfields M such that F Cc M, K is separable 
over M, and the genus of M is greater than 1. In the notation of the 
corollary, if (u,v) is a non-constant separable solution to XN +Y" = 1, 
then F'(u, v) is a subfield, which satisfies these three properties (its genus is 
(N—1)(N—2)/2 > 1 since N > 3). Thus, we are reduced to worrying about 
how many solutions (u,v) and (u’,v’) can exist with F(u,v) = F(u’,v’). If 
this happens, there is an automorphism of F(u,v), which takes u to u’ and 
v to v’. A function field with genus greater than 2 has only finitely many 
automorphisms (see Iwasawa and Tamagawa [1]). It follows that there are 
only finitely many non-constant, separable solutions to XN +YN = 1 in 
K. 

The theorem of de Franchis is not easy to prove. The paper by E. Kani 
[1] contains a proof of an effective version of the theorem. The bibliography 
of that paper gives a number of relevant references to both the classical and 
more modern treatments. 


Exercises 


1. Let K = F(z, y) be a function field where x and y satisfy an equation 
of the form Y* = (X —a,)(X —az)---(X —a;). We assume the a; are 
distinct elements of F’. Let the divisor of x — a; in F(x) be denoted 
by P; — P,,. For each 7 show that tK/F(2)Ps = 258; where P; is a 
prime of K of degree 1. Use this information to compute the genus 
of K (don’t forget the role of the prime at infinity). 


2. With the same notation as in Exercise 1, suppose that n > 5. Show 
that each prime of K which is ramified over F(z) is a Weierstrass 
point (see Exercise 10 of Chapter 5). 


3. Let / be a prime not equal to the characteristic of F and K = F(z,y) a 
function field where x and y satisfy Y' = (X —a,)"!(X—ag)™ +.-(X— 
Am)". We assume that the a; are all distinct and that for each i, 
1 4n,;. Compute the genus of K. 


10. 


11. 
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. Assume that F contains a primitive N-th root of unity and that 


N is not divisible by the characteristic of Ff’. Consider a function 
field K = F(ax,y) where x and y satisfy an equation of the form 
XN +4Y%N =1. Compute the genus of K. 


. Let K be a function field of genus 0 and L/K a finite geometric 


extension. If L/K is unramified, show that LD = K. (Assuming the 
constant field is algebraically closed, this is the algebraic equivalent 
of the statement that the projective line is simply connected). 


. Let L/K be a finite, tamely ramified, geometric extension of the 


rational function field. Let P be a prime of K of degree 1. Suppose 
that L/K is unramified except possibly at primes lying above P. 
Show that L= kK. 


. Let L/K be a finite, separable, geometric extension of function fields. 


Set [L : K] = n. Suppose that degDz;~ > 4(n — 1). Show that 
gptl>n(gx +1). 


. With the same notation and assumptions as Exercise 7, suppose $ is 


a prime of D of degree 1 and that $ is totally ramified over K. Show 
that $B is a Weierstrass point. 


. Let L/K be a finite, separable, geometric extension of function fields 


with five or more totally ramified primes all of degree 1. Show that 
each of them is a Weierstrass point. (The results contained in Exer- 
cises 7, 8, and 9 are due to J. Lewittes.) 


Let S be a finite set of primes of the function field kK. Let a,b € K*. 
Show that the equation aX +bY = 1 has only finitely many solutions 
in S-units. 


Assume F is finite and let K/F be a function field, a,b € K*, and N > 
5 an integer not divisible by the characteristic of F’. Show that the 
equation aX" + bY = 1 has only finitely many separable solutions 
in K. If at least one of the two elements a and 0 is not a constant, 


there are only finitely many solutions altogether. (Hint: Pass to the 
extension field L = K( ¥/a, ‘/b).) 


8 


Constant Field Extensions 


In this chapter we investigate a very important class of extensions of func- 
tion fields, namely, constant field extensions. Let K/F be a function field 
with constant field F’. For every field extension EF of F' we want to define 
a function field KE over E and investigate its properties. We shall confine 
ourselves to the special case where E’/F is algebraic, which is substantially 
easier and which will suffice for most of the applications we have in mind. 
However, the general case is both interesting and important. Expositions 
of the general case can be found in Chevalley [1] and Deuring [1]. 

Let K be an algebraic closure of K and F Cc K the algebraic closure of F 
in K. If E is any field intermediate between F' and F’, we set KE equal to 
the compositum of K and F inside K. By definition, K is finitely generated 
and of transcendence degree 1 as a field extension of F’, and it is clear from 
this that KE is finitely generated and of transcendence degree 1 as a field 
extension of F.. Thus, K E is a function field over E. It is called the constant 
field extension of K by E. It is not true, in general, that F is the constant 
field of KE, but as we shall see shortly, this is often the case. The genus 
of K £& is always less than or equal to the genus of K. It can be shown by 
example that the genus can decrease. Once again, though, it is often the 
case that the genus remains unchanged under constant field extension. The 
magic hypothesis which tends to eliminate all “pathological” behavior is 
that F’ is a perfect field. We shall make that assumption throughout this 
chapter, except when explicitly stated to the contrary. For emphasis — 
unless otherwise stated we shall assume for the rest of this chapter that F 
is a perfect field. As a consequence E/F will always be separable algebraic 
and thus KE/K is also separable algebraic. 
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The last topic we will consider in this chapter is the theory of constant 
field extensions when the constant fields involved are finite. This will involve 
interesting questions. Among other things we will consider how primes, the 
zeta function, and the class number behave under constant field extension. 
In a later chapter, Chapter 11, we will consider the behavior of the class 
group and the class number of constant field extensions in greater detail. 


Proposition 8.1. Assume [E : F] < oo. Then, |KE: K|] =[E: F]. Any 
basis for E/F is also a basis for KE/K. 


Proof. Suppose first that E’/F is a finite, Galois extension. Then, by a stan- 
dard theorem in Galois theory, KE/K is also Galois and Gal(KE/K) = 
Gal(E/K 1 E). Since F is the constant field of K, EN K = F. It follows 
that Gal(K E/K) and Gal(E/F) have the same number of elements, which 
implies |KE: K] =[E: F]. 

Now suppose E//F is finite and separable. Let EF, be the smallest ex- 
tension of E in F which is Galois over F. Then [F, : F] = [KE,: K] = 
[KE, : KEKE: K] < [F, : EE: F] = [£, : fF]. The inequality in 
the middle comes about because, obviously, [KE, : KE] < [F,; : E] and 
[KE : kK] <[E: F]. We conclude that both inequalities are in fact equali- 
ties. This proves the first assertion. 

Suppose {Q1,Q2,:-- ,@,} is a basis for F/F’. It is easy to see that this set 
also generates K EF as a vector space over K. By the first part of the propo- 
sition, it follows that the set is also linearly independent since otherwise 


[KE:K]<n=[|E: Fl. 


We will need the following lemma in several of the following proofs. 


Lemma 8.2. (a) Suppose L/K is a finite extension of fields and that K 
contains a field F which is algebraically closed in kK. If B € L is algebraic 
over F’, then trr;K (G8) € F. (b) Suppose L/K is a finite extension of fields 
and that O C K is a subring of K which is integrally closed in K. Ifb € L 
is integral over O, then try ;«(b) € O. 


Proof. This is fairly standard so we merely sketch the proofs. 

For part (a) one considers the minimal polynomial for @ over K and 
shows that all its roots (in some extension field) are algebraic over F’. Thus 
the sum of the roots is algebraic over F and in K, so the sum of the roots 
is in F’. The trace is an integer multiple of the sum of the roots, so it is in 
F as well. 

Part (b) is similar. One shows that all the roots of the minimal polyno- 
mial for @ over K are integral over O. This implies that the sum of the 
roots is integral over O. The sum is also in K. Since O is integrally closed, 
the sum of the roots is in O. The trace is an integral multiple of the sum 
of the roots so it is also in O. 


Proposition 8.3. EF is the exact constant field of KE. 
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Proof. We have to show that any element of KE which is algebraic over 
FE is actually in E. 

Assume first that [E : F'] < oo, and that {a,,a2,--: ,a@n} is a basis for 
E/F. Suppose @ € KE is algebraic over EF. By Proposition 1, we may write 
B = S>j_, tia; where the x; € K. Multiply this relation by a; and take 
the trace of both sides. We find 


va) 
trxe/K(0;8) => treesK(ajoi) % 1<j<n. 


i=1 


Since 2 is algebraic over E and E is algebraic over F’, it follows that 6 
is algebraic over F’. By Lemma 8.2, part (a), try g/K(aj@) € F. Thus, by 
Cramer’s rule, we find each x; € F’. (We have used det(trx g/K(aia;)) = 
det(trz/r(aia;)) 4 0, which is true because E/F is separable). It follows 
that B = So, ria € EL. 

Now suppose that E’/F is algebraic but not necessarily a finite extension. 
Since @ € KE we must have G € KE, for some F C EF, C E with 
[E, : F] < oo. By enlarging E), if necessary, we can suppose ( is algebraic 
over /,. By the first part of the proof, @ € E,, which is contained in E. 
The proof is complete. 


Our next task is to show that constant field extensions of function fields 
are unramified extensions. This will be an easy consequence of the next 
lemma. 


Lemma 8.4. Let E'/F be a finite extension with {a1,Q2,--+ ,Qn} a basis 
for E over F. Let P be a prime of K and Op the corresponding valuation 
ring. Let Rp be the integral closure of Op in KE. Then {ay, a2,--- , Qn} 
is a free basis for Rp considered as an Op module. 


Proof. Since F C Op by definition, and each a; is algebraic over F’, it 
follows that each a, is integral over Op. 

Suppose b € Rp. By Proposition 8.1, we can write b = )>."_, zi0;, where 
each x; € K. Multiply this relation by a; and take the trace of both sides. 
One finds 


nr 
trxey/K(0jb) =) trensK(ajou) % I1Sj<n. 

i=1 
The left-hand side of these equations are in Op by Lemma 8.2, part (b). 
Again invoking Cramer’s rule and using the fact that the determinant of 
the coefficient matrix is a non-zero element of F' we conclude that each 7; is 
in Op. Thus, {a1,@2,--- ,@n} spans Rp over Op. It is linearly independent 
over Op (being a basis for KE over K) so it is a free basis for Rp over Op 
as asserted. 


Proposition 8.5. Suppose E/F is a finite extension. Then, KE/K is 
unramified at all primes. 
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Proof. Let P be a prime of K, Op its valuation ring, and Rp the integral 
closure of Op in KE. By Lemma 8.4, any field basis {a1,a2,+++ ,Q@n} for 
E/F is a free basis for Rp considered as an Op module. The discriminant 
ideal Op, /op is generated by det(try g/;K(a:a;)), which is a non-zero ele- 
ment of fF’. Thus, 0z,/0, = Rp. It follows by Proposition 7.9 of the last 
chapter that KE/K is unramified at every prime above P. Since P was 
arbitrary, the proof is complete. 


It is possible to talk about infinite algebraic extensions being unrami- 
fied. Once these definitions are given it can be shown that Proposition 8.5 
remains valid even without the restriction that E'/F be a finite extension. 

Now that we know KE/K is unramified, we want to find out how the 
degree and dimension of a divisor behaves in constant field extensions. For 
notational convenience, set L = KE. Let A be a divisor of kK. We want to 
compare deg, iz/;xK(A) with deg, A. This will turn out to be fairly easy. 
More difficult will be the comparison of I(iz;%(A)) with 1(A). We begin 


with two lemmas. 


Lemma 8.6. Let {@1,22,-:: ,2m} C K be linearly independent over F. 
Then, considered as a subset of KE, it remains linearly independent 
over E. 


Proof. Suppose 5°)”, 6;2; = 0 with each 3; € E. Assuming E/F is a finite 
extension, let {a1,2,:-+ ,n} be a basis for #/F’. Then, 8; = 77, cjiav; 
with c;; € F. Substituting and interchanging the order of summation yields 


j=1 \i=l 


Using Proposition 8.1, once again, we find }~"., c;;2; = 0 for each j with 
1 <j <n. Since the 2;’s are linearly independent over F by assumption, 
it follows that all the c;; are 0 which implies that all the 6; = 0. 

If E/F is not finite, suppose \“\", 6:2; = 0 with each 8; € E. Let 
£, be the field obtained from F’ by adjoining the elements of the set 
{ 81, 82,--+ , Bm}. Ey is a finite extension of F. Write L, = K E,. Working 
in this field, and using the first part of the proof, we conclude that all the 
GB; = 0. 


Lemma 8.7. Let L/K be a finite extension of function fields and P a 
prime of K. Suppose that {%B1,Be,---,Bg}, the primes above P in L, 
are all unramified over P. Let n € Z be a given integer. Finally, suppose 
ordg, (b) > —n for alli with 1 <i<g. Then ordp(trzx(b)) > —n. 


Proof. Let 7 € K be a uniformizing parameter at P. Then, since each %, 
is unramified over P we have 1 = ordp(m) = ordg, (7) for 1 <i <n. 
The inequalities ordy,(b) > —n are equivalent to ordy,(7”b) > 0. It 
follows that 7”b is in the intersection of the valuation rings Og, where 


8. Constant Field Extensions 105 


1 <12<_n. This intersection is precisely Rp, the integral closure of Op 
in L. Thus, 7b is integral over Op and by Lemma 8.2, part (b), we have 
trr/K(m"b) € Op. It follows that ordp(m"trz/«(b)) = 0 and this is equiv- 
alent to ordp(trz;%(b)) => —n as asserted. 


We are now in a position to answer the questions raised earlier. 


Proposition 8.8. Let E/F be a finite algebraic extension, K a function 
field with constant field F, and L= KE. Let A be a divisor of K. Then, 


(a) deg, in;K(A) = degy A. 


(b)  U(tzyK(A)) = UA) . 


Proof. By Proposition 8.3, E is the exact constant field of LD = KE. By 
Proposition 8.1, [L: K] = |E: F]. Part (a) now follows immediately from 
Proposition 7.7. 

To prove part (b), we recall that I(7z,;%(A)) is the dimension over E of the 
vector space L(iz;%(A)) = {vu € L | (v)_ + in/K(A) = 0}. By Proposition 
7.8, in/K(x)K = (x)z, and it follows immediately that L(A) C L(iz;%(A)). 
Let {x1,%2.--- , 2a} be a basis for L(A). This set is linearly independent 
over F’, so by Lemma 8.6, it is linearly independent over £. Consequently, 


(A) < Ui (A)) - 


The reverse inequality will follow if we can show that {21,22,--+ , vq} gen- 
erates L(iz/%(A)) over E, and this is what we will prove. 
Let z € L(iz;K(A)) and let, as usual by now, {a1,a2,--: , Qn} be a basis 


for & over F’. By Proposition 8.1, we can write z = )~\_, yo; where y; € K 
for 1 <i <n. Multiply both sides by a; and take traces to arrive at 


trpsK(a;2) = Strix (040%) Yi 1 <7 <n. 
i=1 


Suppose we can show that the trace of any element in L(iz/x(A)) is in 
L(A). Then the left-hand side of these equations are in L(A) and by 
Cramer’s rule, each y; € L(A). It follows that each y; is in the F-linear 
span of {@1,%2,--- , £q} and so z is in the E-linear span of {21, 22,-++ , rg}. 

It remains to prove that v € L(iz/%(A)) implies trz;~(v) € L(A). The 
main tool in doing this will be Lemma 8.7. We begin by recalling that 
v € L(t; (A)) if and only if for every prime $8 of L the following inequality 
holds: 


ordy(v) = —ordy(iz/K(A)) . 


Let P be the prime of K lying below $8. Since L/K is unramified by 
Proposition 8.5, we have ord (iz/%(A)) = ordp(A). The condition for v 
to belong to L(iz/%(A)) can be rephrased as follows. For all primes P of 
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K let {$1, Beo,--- , By} be the set of primes of L lying above P. Then, for 
each 7 with 1 <7 < g we have 


ordy, (v) > —ordp(A) . 


By Lemma 8.7, this implies that ordp(trz;«(v)) = —ordp(A) for all primes 
P of K. These are exactly the conditions for trz;~%(v) to belong to L(A), 
so the proof is complete. 


Proposition 8.8 provides us with all the background we need to determine 
how the genus behaves in constant field extensions. 


Proposition 8.9. Let E/F be a finite extension and L = KE. Then the 
genus of L, considered as a function field over E, is equal to the genus of 
Kk. (Once more we emphasize that, by hypothesis, F' is perfect). 


Proof. Let g be the genus of K and g’ the genus of L. Choose a divisor 
A of K such that deg, (A) > max(2g — 1, 2g’ — 1), e.g., A=nP, where P 
is a prime divisor and n is a sufficiently large positive integer. By Propo- 
sition 8.8, part (a), we have deg; (iz/%(A)) = degx (A). By Corollary 4 to 
Theorem 5.4 we have 


(A) = degx(A)—g+1 and Uizyx(A)) = degz(izyx(A)) — 9! +1. 


By Proposition 8.8, part (b), we have I(iz/%(A)) = I(A). It follows that 
—-g+1=-g’+landsog=q’. 


In the last proposition we could have assumed that E’/F is algebraic, but 
not necessarily finite. The conclusion makes sense and is correct. To prove 
this in complete generality necessitates a discussion of extension of divisors 
in infinite, algebraic constant field extensions. We will sketch how this goes 
after we investigate the way in which primes split in finite constant field 
extensions. 


Proposition 8.10. Let E/F be a finite extension and L= KE. Let 3 be 
a prime of L and P the prime lying below it in K. Define Ex = On /B 
and F'ip = Op/P. Then, Es is the compositum of Fp and E. 


Proof. Let @ € Eg and let w be an element of Og representing @. Let’s 
consider {8; = , Po,--- ,B,} , the primes in L lying over P. By the 
weak approximation theorem, we may find an element w’ € ZL such that 
w’ =w (mod $) and w’ = 0 (mod ¥;) for 2 <i < g. Then w’ € Rp, the 
integral closure of Op in L. By Lemma 8.4, any basis {a 1, @2,--- ,Qn} of 
E;/F is automatically a free basis of Rp considered as a module over Op. 
Thus, 


nm 
Ww! = EZ a; with 2; € Op. 


t=1 
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Now reduce both sides modulo $8 and we see that @ is in the compositum 


of Fp and E. 


By a small variation of this proof, we can give a very explicit way of 
understanding how primes split in a constant field extension. 


Proposition 8.11. With the notation of the previous proposition, suppose 
Fp = F{6] and that h(T) € FIT] is the irreducible polynomial for @ over 
F’. Let 

A(T’) = hi (D)ha(L) «+ hg (TL) 
be the prime decomposition of h(T) in E/T]. There are exactly g primes 
{$1,Be,---, By} of L lying above P. The numbering can be chosen in 
such a way that for1 <i<g we have deg, B; = degh,(T'). Moreover, 


g 
degy P=) deg, Pi . 
i=l 
Proof. Lemma 8.4 can be restated to say that Rp = Op @p LE. Reducing 
both sides modulo P yields, Rp/PRp = Fp ®F E. By hypothesis, Fp = 
Fié| = F{[T]/(h(T)). Thus, 


Rp/PRp & Fp ®p E & E{T]/(h(T)) = Qe 


The right-hand side is a direct sum of fields. Let M; be the maximal ideal 
which is the kernel of projection on the i-th factor. Let p; be the maximal 
ideal of Rp which goes to M; under Rp — Rp/PRp followed by the above 
sequence of isomorphisms. Set 8; equal to the maximal ideal of the ring 
“Rp localized at p,,” i.e., the ring Og,. A simple check shows that the set 
of primes of L given by {81, Po,--- ,%,} has all the properties asserted, 
except perhaps the last one about the sum of the degrees. To prove this, 
simply notice that )7?7_, deg, Bi = D-7_, deg hi(T) = deg h(T) = deg, P. 


Corollary. Suppose Fp = F[T]/(h(T)) and that E is an extension of F of 
degree n in which h(T) decomposes as a product of linear factors. Then in 
KE the prime P splits into n primes of degree 1 


Proof. Clear. 


The proof of the above proposition was most easily accomplished by 
choosing a primitive element (which exists since F’p/F is separable) for the 
field extension F'ip/F. However, the situation can be described in a more 
canonical way without having to make any choices. Consider the algebra 
F’p @F E over F’. The proof shows this algebra is a direct sum of fields, Ly, 
say, each of which is a field extension of E. 


g 


wl 
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Then there is a one-to-one correspondence between primes %3;, lying above 
P in KE and the fields L; with the property that the residue class field of 
3B; is isomorphic to the field L;. 

We have been assuming that the constant field extension E/F is finite, 
but this is not necessary. Let E'/F be an algebraic, but possibly infinite, 
extension of fields. Using properties of tensor product and Lemma 8.4, 
one can prove that the integral closure Rp of Op in KE is isomorphic to 
Op @r E, the map being w @ a > aw € Rp. The statement and proof of 
Proposition 8.11 can now be repeated without change. 

As a special case, let F = F’, an algebraic closure of F. Every polynomial 
in F'[T] splits into a product of linear factors in F'[T’]. Consequently, every 
prime P of K splits into deg, P primes of degree 1 in KF. This will be 
very useful later when we discuss how to use results about the geometry of 
algebraic curves to give us information about the arithmetic of algebraic 
function fields. 

As an illustration of the material developed in this chapter we will now 
discuss the particular case when the constant field F' = F, a finite field 
with q elements. Let F be an algebraic closure of F and F, the unique 
intermediate extension such that |[F, : F] =n. Set K, = KF py. 

We recall some definitions from Chapter 5. We set a,(K) equal to the 
number of primes of degree m, b,,(K) equal to the number of effective 
divisors of degree m, and h(K) equal to the number of divisor classes of 
degree zero, i.e., the class number of K. The latter number was denoted hx 
in Chapter 5. These numbers are all finite. We would like to compare them 
with the numbers a@m(Kn), bm(K,n), and h(K,). We also want to compare 
the zeta function of K, with that of K. Of course, all these questions are 
interrelated. There are connections between this material and Iwasawa’s 
theory of cyclotomic number fields. We will discuss these connections in 
more detail in Chapter 11. 

The first thing to do is to make precise the way in which primes of K 
split in K,,. Let P be a prime of K and $ a prime lying above it in K,. By 
Proposition 8.10, the residue class field of 8, is the compositum of Op/P 
and F,, inside Og /$8. To compute the compositum and its dimension over 
FF we can invoke the following simple lemma. 


Lemma 8.12. The compositum of F, and Fm is Fin, my where [n,m] is the 
least common multiple of n and m. 


Proof. Let F;,, be the compositum of F,, and F,, inside F. Since Fy, Fin C 
F,, we have n,m | h, which implies [n,m] | h. Thus, Finjm) G Fr. On the 
other hand, since n,m | [n, m], we have F,, Fm C Fin,m), and so Fr, © Fin my. 
Proposition 8.13. Let P be a prime of K. Then P splits into (n, deg x P) 
primes in K,. Let B be a prime of Ky, lying over P. Then 


deg x P n 


M8KnB Gdege P) MO LORIE) = Go gege Py 


8. Constant Field Extensions 109 


Proof. By definition, the dimension of Op/P over F is degy P. Thus, 
by the above lemma, the compositum of Op/P and F,, inside Om / has 
dimension [n, deg, P] over F. By Proposition 8.10, this compositum is equal 


to Os /SB, and so 


, _[n,degx P|] deg P_ 
degie, B= [Op/P : By] = PE SBKAL SE 

The last equality follows from elementary number theory — for any two 
non-zero integers n and m, nm = (n,m)|n, mJ. 

The relative degree f(%8/P) is the dimension of On /%8 over Op/P, which 
in this case is [n, deg, P|/degy P =n/(n, deg, P). 

Finally, we recall that K,,/K is unramified. Since each prime 8 over P 
has relative degree n/(n, deg, P) and [|K, : K] =n we see the number of 
primes above P is (n,degy P) by Theorem 7.6. 


Corollary. A prime P of K splits into deg, P primes of degree 1 in Ky, 
if and only if deg; P divides n. 


Proof. This is immediate from the proposition. 


Proposition 8.13 is the key to comparing the zeta function of K with 
that of K,. The only other piece of information needed is provided by the 
following elementary lemma. 


Lemma 8.14. Let ¢, € C be a primitive n-th root of unity and m a positive 
integer. Then 


n—1l 


I[a _ CM yl) _ (1 _ ylrm]) (n,m) . 

i=0 
Proof. First consider the case where m = 1. The result in this case follows 
from the identity T” —1 = [];_,({ — ¢¢) by making the substitution 
T =u! and simplifying. 

In the general case, let m’ = m/(n,m) and n’ = n/(n,m). It is easy to 
see that ¢”” is a primitive n’ root of unity. Call it ¢,,. Every 7 in the range 
0 <%< ncan be uniquely represented in the form i = kn’ +r, where 
0<k< (n,m) andO<r<_n’. Thus, 


(nym)—-1 n’'—1 


[]a-crw) = [TL a= ciu) =a uy 
i=1 h=0 =0 


Finally, mn’ = mn/(n,m) = [n,m]. 


Theorem 8.15. Let u=q~* and CxK(s) = ZK (u). Then, 


n—-l 
Cx, (8) = ZK,(u”) = Il ZK (GU) . 
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Proof. Since the constant field of Ky, is F, which has gq” elements, we 

have €x,,(s) = Zx,,(u’), where u’ = (q”)~* = q-™ =u". Thus, ¢x,(s) = 

ZxK,,(u™), which proves the first equality in the statement of the theorem. 
Setting dp = (n, deg, P) for each P € Sx, we have 


Cx, (s) = [][a— Np) =][a-NP Ce )e , 
B P 
where $B ranges over Sx, and P ranges over Sx. We have used Proposition 


8.13 and the fact that NB = NPSP/P), Since NP = q%8x PF, by definition, 
the last product can be rewritten as 


n—-1 
ate _ unrdegx P/dp)—dp _ Il [[a _ cides x P, dee x Py-1 


P P i=0 


Here we have used Lemma 8.14 with m = deg, P. Recall that Zx(u) = 
[[p(1 — ute8« ”)-1 Now, interchanging the order of the products on the 
right-hand side of the above identity completes the proof. 


By the proof of Theorem 5.9, Zx(u) = Lxe(u)/(1 — u)(1 — qu) where 
Lx(u) is a polynomial of degree equal to twice the genus g of K. By 
Proposition 8.9, A, has the same genus as K. Let u’ = wu”, as above, and 


we have Lr. (w) 
Z \ _ KA) 
Kn (u ) (1 _ u')(1 _ q™u’) 


Proposition 8.16. Let LDx(u) = T2..(1 — ju) be the factorization of 
Lx(u) in Clu]. Then, 


Lx, (u') = [[a — Tu’) . 


j=l 


Proof. Using the definitions and Theorem 8.15, we find 


Dx, (u” — Ctu) 
ears “Tl; eat re u) 


For any complex number 7 we have the identity [[j__, 5 (1- —Ciru) = 1-7" u". 
Thus, 
Lx,,(u") 7 T1211 — wPu”) 
(L—u™)(1—qgru")  (1—um)(1— gnu) © 
The proposition follows upon canceling the denominators. 


Corollary. h(K,) = [[f2,(1 — 7). 
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Proof. Applying Theorem 5.9 with K replaced by K, we find Lx, (1) = 
h(K,). The result follows upon substituting u’ = 1 in the proposition. 


By the Riemann Hypothesis, |7;| = ,/q for 1 < 7 < 2g. Using this and 
the corollary we get the following lower bound for h(K,,): 


h(Kn) > (qr/? — 1)°9 . 


This shows that h(K,,) grows rapidly with n. A more precise investigation 
of how h(K,,) varies with n is possible. The results one obtains lead the way, 
by analogy, to Iwasawa theory in algebraic number fields (see Iwasawa [3]). 
As stated earlier, we will discuss these matters in greater detail in Chap- 
ter 11. 

The next proposition gives some insight into how the numbers },,(K,,) 
grow with n. 


nm—g+tl__ 


Proposition 8.17. b,(K,) = h(Kn)4 , provided m > 2g — 2. 


Proof. We proved earlier (see the remarks following Lemma 5.8) that if 
m > 2g — 2, where g is the genus of K, that 
g™—9gtt —] 
bm (1) = h( ke) —————— 
(K) = WK) 


In this equality, replace K by K, and q by q” (since the constant field of 
kK, has q” elements). The resulting equation is valid because the genus of 
K,, is the same as the genus of K (Proposition 8.9). 


In Chapter 5 we provided three different description of Z,%(u), namely, 


Z(u) = S— bm(K)u™ = [[ (1 — ut) = exp (> Nm) v) | 


™m 


By definition, Nin(K) = dv ajm 4aa(K) and we showed in Chapter 5 that 
Nm(K) = q™+1- rad _7;". Although these numbers are clearly very 
important we did not give an interpretation of them. We can do so now. 


Proposition 8.18. N,,(K) is equal to the number of prime divisors of 
Ky», of degree 1. 


Proof. It is interesting to note that Ni(K) = ai(K), so the result is 
certainly true when m = 1. 
To prove the general case, we invoke Proposition 8.16. 


HjaiG = are’) 
(1 = w/)(1 — gu’) 
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Compute Z (0)/Zxm(0) in two different ways using these expressions. 
We find 


29 
gq? +1—S on™ =bi(Kn) - 
j=l 


The left-hand side is just N,,(K) and the right-hand side is the number 
of effective divisors of degree 1 in K,,, which is the same as the number of 
prime divisors of K,, of degree 1. 


We have proved Ni»(K) = b1(Km) = a1(Km) = Ni(Km). Assuming 
some algebraic geometry we can reword the result as follows. Let X be an 
absolutely irreducible, non-singular curve defined over a finite field F with 
q elements. For each m > 1 let N/,(X) be the number of rational points on 
X over F,,, i.e., Ni (X) = #X(F,). It can be shown that N/,(X) is equal 
to the number of prime divisors of degree 1 belonging to the function field 
Fim(X) of X over F,,. This means that N/ (X) = Np(K) by the above 
proposition, where K = F(X). Thus, the zeta function of the function field 
of X, K = F(X), is equal to exp(-~_, Ni, (X)/m u™). This approach 
enables one, in a fairly obvious manner, to define the zeta function of 
a variety X of any dimension over a finite field by using the numbers 
#X(F,). A beautiful exposition of the general theory is given in Serre [1]. 


Consider the identity N,,(K) = Ni(K7,,). Let’s apply this to the field 
K,, rather than K. It is easy to see that (Kn)m = Knm and it follows that 


Nin(Kn) = Ni(Knm) = Nn(Km) - 


This identity allows us to derive an interesting expression for the number 
of primes of degree m in the field Kn, i.e., @m(Kn). 


Proposition 8.19. am(Kn) = M7" Yam H(d)a1(Knm/d) - 


Proof. From the definition, N,,(K,) = dim daqg(K,,). Using Mobius in- 
version, we see that Mam(Kn) = diam H(@)Nmya(Kn). From the relation 
Nm/d( An) = Ni(Knmsa) = 1(Knma) the result follows. 


There is much more to be said about the fascinating sequences of numbers 
we have introduced, but it is time to break off this development for now 
and to pass on to other matters. 


Exercises 


1. Let K = F(zx,y) be the function field associated to the curve Y* = 
f(X), where f(X) is a square-free polynomial of degree n. Assume 
that char(f’) #4 2. Compute the genus. (Hint: Reduce to the case 
where f(X) is a product of linear factors and apply Exercise 1 of 
Chapter 7). 
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. Generalize Exercise 1 to the case where / is a prime, f(X) is l-th 
power-free, | { char(F’), and the curve is Y! = f(X). 


. Consider the curve X% + Y™% = 1 and the associated function field 
F(x, y). Assume char(F’) { N. Compute the genus of K. 


. Let Fo be a field of characteristic p > 0 and set F' = Fo(T), the ra- 
tional function field over fo. Consider the function field K = F(z, y) 
over F', where x and y satisfy the equation Y? = X? —T. Prove that 
the genus of K is (p — 1)/2 (use the Riemann-Hurwitz theorem ar- 
guing via the extension K/F(x)). Now, extend the constant field to 
F’ = F(#/T). Show that K’ = F’(x,y) has genus 0. This does not 
contradict Proposition 8.9 since the extension F’/F is purely insep- 
arable. 


. Let E/F be a finite Galois extension with group G. Identify G with 
the Galois group of KE/K. Let B be a divisor of KE which is in- 
variant under G, i.e., 06 = B for all o € G. Show that L(B) has a 
basis consisting of elements of K. (Hint: Use Propostion 9.2 of the 
next Chapter to show that B = ixg/%B for some divisor B of K. 
Then invoke the proof of Proposition 8.8). 


. Let K/F be a function field over a finite field and let Lx(u) = 
Ik? ,(1 — mu) be the numerator of the zeta function of K. Assume 
that there is a positive constant C’ such that for all r > 1 we have 
|N-() —q" —1| < Cq?. Prove that |7;| = ,/@ for all i. (Hint: Expand 
Li. (u)/LK(u) in a power series about u = 0 and consider the radius 
of convergence). 


. Let K’/F be a function field of genus 1 over a finite field. Show that 
N,(&) determines all the other numbers N,(K). 


. Generalize the last exercise as follows. Let K/F be a function field 
of genus g > 1 over a finite field. Show that the numbers N(K), 
No(K),...,.Ng(K) determine all the other numbers N,(K). 


9 


Galois Extensions - 
Hecke and Artin L-Series 


In Chapters 7 and 8 we discussed finite extensions L/K of algebraic func- 
tion fields. We propose to continue that discussion here under the special 
assumption that the extension L/K is Galois. To simplify the discussion 
we continue to assume that the constant field F' of K is perfect. 

After proving a number of basic results in the general case, i.e., F’ being 
perfect but otherwise arbitrary, we specialize to the case where F = F, 
a finite field with q elements. Then, for every prime $8 of L unramified 
over K we associate an automorphism (%, L/K) in G = Gal(L/K) called 
the Frobenius automorphism of 8. This is one of the most fundamental 
notions in the number theory of local and global fields. It will be seen 
that if P is a prime of K, unramified in L, the set of automorphisms 
(P,L/K) =: {(%,L/K) | 8 above P} fill out a conjugacy class in G. 
Suppose C’ C G is a conjugacy class. One can ask how big is the set of 
primes P € Sx such that (P,L/K) = C? The answer to this question is 
given by the ‘I’chebotarev density theorem. We will discuss two forms of this 
important result, one involving Dirichlet density and the other involving 
natural density. The key tool will be Artin L-series and their properties. 

Let x be a complex character of the group G = Gal(L/K). E. Artin 
showed how to associate an L-function, L(s,), with such a character (see 
Artin [2]). It is defined and analytic in the half plane {s € C | R(s) > 1}. 
Artin was able to show that L(s,x) can be analytically continued to a 
neighborhood of s = 1 and that if y is irreducible and y # xo, the trivial 
character, then L(1,x) 4 0. It is this property which will enable us to 
prove the version of the Tchebotarev density theorem formulated in terms 
of Dirichlet density. 
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Artin conjectured that his Z-series can be meromorphically continued to 
the entire plane and that if y is non-trivial and irreducible then L(s, x) can, 
in fact, be continued to an entire function on the whole plane. R. Brauer 
was able to prove the first part of this by means of a deep theorem about 
group characters. His proof works in both number fields and function fields. 
The second part of Artin’s conjecture is still an open question in number 
fields. It is one of the most important open questions in that area. In the 
function field case, the matter was resolved by A. Weil [1] in the same 
small book in which he first proved the Riemann hypothesis for curves over 
a finite field. He showed, subject to a small technical restriction (which 
we will discuss), that if x is non-trivial and irreducible, then L(s, x) is a 
polynomial in q~*. On the basis of this result we will give a proof of the 
version of the Tchebotarev density theorem formulated in terms of natural 
density. 

The reader will not fail to notice that the above discussion has the same 
flavor as the material in Chapter 4 where we discussed Dirichlet L-series 
and the Dirichlet theorem about primes in an arithmetic progression. How- 
ever, in that chapter there was no discussion of Galois groups and charac- 
ters on them. We considered the groups (A/mA)* and to a character on 
such a group we associated an L-series. Is there any connection between 
the two types of L-series? The answer is yes, but the explanation is ex- 
tremely subtle and difficult. It is by trying to answer this question in the 
most general context that Artin was led to the famous Artin reciprocity 
law, perhaps the deepest and most far-reaching theorem in all of algebraic 
number theory. We will attempt a general discussion of these matters, but 
mainly without proofs. We will investigate what happens when Gal(L/K) 
is an abelian group. This will lead to a rough statement of Artin’s reci- 
procity law. When G(L/K) is abelian and L/K is unramified we will give 
a proof of Weil’s result using Artin’s reciprocity law. In general, we will de- 
fine Hecke L-functions for characters of finite order (Dirichlet L-series are 
a special case of these) and state some of their properties without proof. 
Artin reciprocity allows one to show that for one-dimensional characters 
Artin L-series L(s,y) “are” Hecke L-series and, in the abelian case, Artin’s 
conjecture about his L-series being entire will follow from this. 

It is time to begin! 


We assume L/K is a finite, Galois extension of function fields and denote 
the Galois group by G = Gal(L/K). As usual, let F' be the constant field 
of K and £& the constant field of L. 


Proposition 9.1. The field extension E:/F is Galois and the map G > 
Gal(E/F) obtained by restriction of automorphisms to E is onto. Let N C 
G be the kernel of this map. Then the fixed field of N is KE, the maximal 


constant field extension of K contained in L. 
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Proof. If ao € G and a € E, then the fact that a is a root of a polynomial 
with coefficients in F’ shows that oa must be a root of the same polynomial 
since o fixes F’. Thus, ca is algebraic over F, which implies ca € E. This 
shows that the restriction map takes G to the group of automorphisms of 
F, which leave F fixed, namely, Aut(E/F). Let G’ be the image of this map. 
Then, the fixed field of G’ is EM K = F. This proves E'/F is Galois and 
that G’ is its Galois group. 

It is clear that N leaves K E fixed, so to prove KE is the fixed field of N 
it suffices to show |N| = |L : KE]. Since G/N & G’ we see |G| = |N||G"| 
which by Galois theory is the same as [L: K] = |N|[E: F']. By Proposition 
8.1, [fh : F| =|[KE: K] and it follows that |N| = [L: KE] as required. 


Let $ be a prime of L lying over a prime P of K. Recall that $B is the 
maximal ideal of a discrete valuation ring Os which contains the constant 
field & and whose quotient field is L. Let 0 € G. Then oO is a dvr with 
the same properties and its maximal ideal is o$8. Thus, o% is another prime 
of L and it is easy to verify that it also lies above P. The group G acts as 
a group of permutations on the set of primes above P. 


Proposition 9.2. Let {%1,PBo,--- ,B,} be the set of primes of L lying 
above P. The Galois group G acts transitively on this set. 


Proof. For each i with 1 < i < g we need to show there is ao € G such 
that oP, = By. 

Consider the set {o, | o € G}. Suppose some ; is not in this set, B, 
say. We will derive a contradiction. 

By the weak approximation theorem we can find an element x € L such 
that c = 0 (mod $B,) and x =1 (mod $B;) for 1 ¥ g. Since these conditions 
imply x € Os, for all 1 <i < g, we have x € Rp the integral closure of Op 
in L. It follows that ox € Rp for allo € G and [[, .gax€ RpNK = Op. 
Since x € Py Rp, we have, [],-<gorx € P,NRpPNOp =P C Fy. Since 
93, is a prime ideal, there is a 7 € G such that rx € $B, and so x € 77! Pj, 
which contradicts x = 1 (mod 7~!9,). 


Proposition 9.3. We continue to use the notation introduced above, except 
that we now denote the number of primes in L lying above P by g(P). We 
have f(Bi/P) = f(B;/P) and e(P;i/P) = e(B;/P) for all 1 < i,j < g. 
If we denote by f(P) the common relative degree and by e(P) the common 
ramification index, then e(P)f(P)g(P) =n =[L: K]. In particular, e(P), 
f(P), and g(P) divide n. 


Proof. For a given pair i and j there is an automorphism o € G such that 
op; = Bj. Map Og, /P;i 9 Op, /PB; by G — GU, It is straightforward to 
check that this map is well defined and gives a field isomorphism which 
leaves Op/P fixed. It follows immediately that f(%;/P) = f(;/P) as 
asserted. 
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Similarly, if POp, = Bf, applying o to both sides yields POg, = P¥. 
Thus, e(B;/P) = e(B,/P). 

By Theorem 7.6, a) e(%;/P)f(%i/P) =n, so the last two assertions 
follows from this and the first part of the proof. 


Let $% be a prime of L lying over a prime P of K. We now define two 
important subgroups of G = Gal(L/K): 


Z(P/P) = {0 €G| oP =P} and 
(p/P) = {reG|tw=w (mod $), Vw € Og} 


The first is called the decomposition group of $8 over P and the second is 
called the inertia group of $8 over P. 


Lemma 9.4. The order of Z2(%3/P) is e(B/P)f(®/P). 


Proof. By Proposition 9.2, the group G acts transitively on the set of 
primes of L lying above P. The group Z(8/P) is the isotropy group for 
this action. From this it follows that [G : Z($8/P)] = g(P), the number of 
primes in L above P. By Proposition 9.3, we have e(B/P)f($%/P)g(P) = 
[L: K] = #G. Thus, #2(%/P) = e(P)f(P). 


Let M C L be the fixed field of Z7($8/P) and p the prime in M lying 
below $8. M is sometimes called the decomposition field of $B. 


Lemma 9.5. With the above notation, 8 is the only prime in L lying above 
p. Moreover, e(p/P) = f(p/P) =1 and |M : K] = g(P). 


Proof. The first assertion follows by applying Proposition 9.2 to the Ga- 
lois extension L/M and using the definition of the decomposition group. 
By Lemma 9.4, #Z(33/p) = e(/p) f(%3/p). On the other hand, Z7(98/p) = 
Z($8/P) and the order of this group is e(/P)f(%/P). The fact that 
e(p/P) = f(p/P) = 1 follows from this since e(/P) = e(/p)e(p/P) 
and f(B/P) = f(/p)f(p/P). Finally, the index relation [LD : K] = 
[L : M\[M : K], together with [L : K] = #G and [L : M] = #Z(/P), 
implies the last assertion that [M : K] = g(P). 


Let’s reintroduce the notation Fis for the residue class field of Osx and 
Fp for the residue class field of Op. 


Theorem 9.6. Suppose L/K is a Galois extension with G = Gal(L/K) 
and that 8 is a prime of L lying over a prime P of K. Then the exten- 
sion E/Fp is also a Galois extension. There is a natural homomorphism 
from Z(§B/P) onto Gal(Eg/Fp) and the kernel of this homomorphism is 
I(B/P). The inertia group is a normal subgroup of the decomposition group 


and #1 (8 /P) = e(B/P). 
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Proof. Since F’p is perfect, there is an element 9 € Og such that Eg = 
Fp(@) where @ is the residue class of 6 modulo %. By using the weak 
approximation theorem, if necessary, we can assume that @ is integral over 
Op. As above, let M be the fixed field of Z($8/P) and f(X) € M[X| 
the minimal polynomial for 0 over M. Since @ is an integral element, the 
coefficients of f(X) are in Op = Omg MM. Since L/M is a Galois extension, 
f(X) splits into linear factors in L, ie., f(X) = []j2,(X —4:) where 6 = 44. 
Reducing modulo 8 , we have f(X) = [[/",(X — 6;). The coefficients of 
f(X) are in the residue class field of Op, which is the same as Fp since 
f(p/P) = 1 by Lemma 9.5. This shows that Eg = Fp(0) is the splitting 
field of f(X) and so Ey is Galois over Fp as asserted. 

Ifo € Z(B/P) and & € Eg, define & by the equation o(@) = ow. It 
is easy to check that @ is a well-defined mapping from Ex to itself which 
is, in fact, an automorphism leaving Fp fixed, i.e., ¢ € Gal(Ly/Fp). The 
map o to @ is a homomorphism and the kernel of this homomorphism is 
I(/P). Again, all this is straightforward from the definition. It remains 
to show that the homomorphism which takes o to @ is onto Gal(Fy/Fp). 

Let A € Gal(Ey/Fp). Let h(X) € Fp[X] be the irreducible polynomial 
of @ over Fp. Then )@ is also a root of h(X). Since 6 is also a root of f(X) 
(see the first paragraph), h(X)|f(X). It follows that A@ = 0; for some root 
6; of f(X). Since f(X) is irreducible over M, there is ao € Z(/P) such 
that of = 0;. Thus, ¢9 = 6. From this, and the fact that 6 generates Esp 
over Fp we can conclude that \ = G. This proves the onto-ness. We have 
shown that the following sequence is exact. 


(e) + I(8/P) + Z($%/P) -> Gal(Ey/Fp) > (e) - 


The middle term has order e(8/P)f(98/P) and the end term has order 
f($B/P). One concludes that #1($%/P) = e(PB/P). 


Corollary. If B/P is unramified, then Z(B/P) & Gal(Eg/Fp). 


We continue with two propositions about how the decomposition groups 
and inertia groups behave “functorially.” 


Proposition 9.7. Suppose L/K is a Galois extension of function fields and 
suppose "8 is a prime of L lying above a prime P of K. Leto € Gal(L/K). 
Then, Z(of/P) = oZ(%/P)o—! and I(oP/P) = ol (P/P)o~!. In par- 
ticular, all the decomposition groups of primes above P in L are conjugate 
and similarly for the inertia groups. 


Proof. By definition, 7 € Z(o%/P) if and only if ro¥% = of. This is so if 
and only if o~'ro8 = , i.e., if and only if o~!ra0 € Z($8/P), which holds 
if and only if rT € ¢Z($8/P)o—'. This proves the first assertion. The proof 
of the second is entirely similar. 

To prove the last assertion, it is enough to recall that by Proposition 9.2, 
all the primes above P are of the form o for o € Gal(L/K). 
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Proposition 9.8. Let L/K be a Galois extension of function fields and 
M an arbitrary intermediate field. Let 8 be a prime of L and p and P the 
primes of M and K respectively which lie below %. Set H = Gal(L/M). 
Then, 


(i) Z2(P/p)=HOZ(P/P) and ICP/p) = AOIB/P) . 


Now, assume H is a normal subgroup and that p is the restriction map 


from Gal(L/K) > Gal(M/K). Then, 
(it) p(Z(B/P))=Z(p/P) and pU(P/P)) =1(p/P) . 


Proof. Part (i) of the proposition follows directly from the definitions. 

To prove part (ii) we first remark that from the definitions it is easy 
to prove that p maps Z(8/P) to Z(p/p). The kernel of the this map is 
Z($/P)A HA = Z2(/p). Thus, the order of the image is 


e(B/P) f(B/P)/e(B/p) f(B/p) = e(p/P) fF (p/P) = #4 (p/P). 


This proves the map is onto. 
The proof for the inertia groups is entirely similar. 


Let L/K be a finite extension of function fields, and P a prime of K. We 
say that P splits completely in L if there are [L : K] primes above it in L. 
From the relation }>7_, e:f; =n it follows that if a prime splits completely 
in L, every prime above it is unramified and of relative degree 1. Suppose 
L/K is a Galois extension and that $B is some prime of L above P. Then, 
by Proposition 9.3 and Lemma 9.4, we see that P splits completely in L if 
and only if Z(8/P) = (e). More directly, the Galois group acts transitively 
on the primes above P and the decomposition group of one of them is an 
isotropy group for this action. Thus, one gets [LD : K] primes above P if 
and only if this decomposition group is trivial. 

We recall that a prime P of K is said to be unramified in D if and only 
if every prime above it in L is unramified. 


Proposition 9.9. Let M, and Mz be two Galois extensions of a function 
field K and let L = M,Mp be the compositum. A prime P of K splits 
completely in L if and only if it splits completely in M, and Mg. A prime 
P of K is unramified in L if and only if it is unramified in M, and Mo. 


Proof. Let $8 be some prime of L lying above P. If P splits completely 
in L, then by the previous remarks Z($8/P) = (e). Let p; and pe be the 
primes of M; and Mo, respectively, which lie below $8. By Proposition 9.8, 
part (iz), we deduce that Z(p;/P) = (e) and Z(po/P) = (e). Thus, P splits 
completely in M, and Mo. 

Now suppose that P splits completely in M, and M2. Then, Z(p;/P) = 
(e) and Z(p2/P) = (e). Let o € Z($%/P). By Proposition 9.8, part (iz), we 
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see the restrictions of o to both M,; and Mp» are the identity maps. Since My 
and M2 generate L, it follows that o is the identity. Thus, Z7(B/P) = (e) 
and so P splits completely in L. 

Once again, the proof of the last assertion about unramifiedness is en- 
tirely similar. We omit the details. 


We conclude this part of the chapter by sketching the behavior of a 
prime P in the fixed fields of 7(8/P) and 1($8/P), where % is a prime of 
L lying above P. To ease the notation, call the two subgroups Z and J and 
the corresponding subfields Dz and L; of L (we previously denoted Lz by 
M). We have K C Lz C L; C L. The fields Lz and Ly are called the 
decomposition field and the inertia field of $8. Let pz and py; be the primes 
of Lz and Ly, respectively, which lie below %. Then f(pz/P) = 1 and 
e(pz/P) =1. If Gal(Z/K) is abelian, it follows that P splits completely in 
Lz. It is the case that p; is the only prime of L; above pz and we have 
e(pr/pz) = 1 and f(pr/pz) = f(®/P) = [L7 : Lz]. Finally, P is the only 
prime of L above p; and we have f(%/p7;) = 1 and e(B/pr) = e(B/P) = 
[L : L7|. We say that pz/P is unramified of degree 1, that py/pz is inert, 
and §8/pr; is totally ramified. All this is relatively easy to prove on the basis 
of our earlier results. We leave the details as an exercise. 


This is about as far as we wish to go with the general theory. Although 
we have been working in function fields, it is clear that most of what we 
have proven will work in a more general context of Dedekind domains, their 
quotient fields, and finite extensions thereof. 

For the rest of this chapter we will be working with global function fields, 
i.e., function fields whose field of constants is finite. A key notion in this 
context is that of the Frobenius automorphism attached to an unramified 
prime ideal. Our first goal will be to define this object and discuss its 
properties. 

Let K be a function field whose constant field F is a finite field with q 
elements. Let L/K be a finite, Galois extension with constant field E. Let 
G, as usual, denote Gal(L/K). Suppose P is a prime of K and $B, a prime 
of L lying above P. The residue class fields are finite and, as is well known, 
the Galois group, Gal(Eq/Fp), is cyclic, generated by ¢p which is defined 
by ¢p(x) = 2” for all x € Eg (the point is that NP = |Fp}). 

If we suppose that $3/P is unramified, then by the corollary to Proposi- 
tion 9.6 we have a canonical isomorphism Z7(3B/P) =~ Gal(Eg/Fp). Under 
these circumstances, there is a unique element (¥,L/K) € Z(98/P) which 
corresponds to @p under this isomorphism. (8, L/K) is called the Frobe- 
nius automorphism of 8 for the extension L/K. Going back through the 
definitions we see that the Frobenius automorphism can be characterized 
by the following condition 


(B,L/K)w=w? (mod P) Vw Ee Og. 
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Proposition 9.10. Let L/K be a Galois extension of global function fields, 
a prime of L and P the prime of K lying below it. Suppose B/P is un- 
ramified. Then, (8, L/K) is a cyclic generator of Z(%3/P) and consequently 
has order f($3/P). Furthermore, if 0 € Gal(L/K), then (oS, L/K) = 
o(%,L/K)o"}. 
Proof. The first assertion is true by the definition of the Frobenius auto- 
morphism via the isomorphism Z7($3/P) = Gal(Eg/Fp). 

To prove the second assertion, recall that for 0 € G we have cOg = Ogg. 
Thus, (o58, L/K) is characterized by 


(of, L/K) ow =(ow)X? (mod of) Vw Ee Og. 


Applying o~! to both sides of this congruence we deduce that o~1(of, 
D/K)o = (8, L/K) from which the result follows immediately. 


From the second part of this proposition and Proposition 9.2, we see that 
as $3 varies over the primes above P in L, the Frobenius automorphisms 
(%, L/K) fill out a conjugacy class in G. This leads to the following formal 
definition. 


Definition. Let L/K be a Galois extension of global function fields, and 
P a prime of K which is unramified in L. The Artin conjugacy class of P, 
(P,L/K), is defined as the set of all Frobenius automorphisms (%, L/K) 


as 58 varies over the primes in L above P. 


The map from Sx to the conjugacy classes of Gal(Z/K) given by P > 
(P,L/K) is called the Artin map. It is extremely important and we shall 
discuss it in some detail. First, however, we will record some more “func- 
torial” properties of the Frobenius automorphism. 


Proposition 9.11. Let L/K be a Galois extension of global function fields 
and M an arbitrary intermediate field. Let % be a prime of L and p and 
P the primes lying below it in M and K, respectively. Assume %/P is 
unramified. Then, 


(Pp, L/K)i°/") = (p, L/M) . 
If M/K is also a Galois extension, then 
(B, L/K)|m = (p, M/k) . 
Proof. Using the characterization 
(P,L/K)w=w'" (mod P) Ww € Og , 
we deduce 


(PB, L/K)F/P)y swNPT"™ (mod P) Vw e Og . 
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Since NP£(P/P) — Np, this also characterizes (p, L/M). This proves the 
first assertion. 
To prove the second, just recall O, = Og 1M and p= 8M. Thus, 


(%,L/K)ws=w? (modp) WweO,. 


This characterizes the automorphism (p, M/K) as well, so (8, L/K)|m = 
(p, M/K), which finishes the proof. 


One of the main goals of this chapter is to investigate the set of primes 
P of K which go to a fixed conjugacy class C in Gal(L/K) via the Artin 
map P > (P,L/K). One way to describe the abundance of such primes is 
via the notion of Dirichlet density. We introduced this notion in a special 
case in Chapter 4. The next task is to give the general definition and to 
investigate its properties. 

Let M C Sx be aset of primes in K. The Dirichlet density of M, 6(M), 
is given by the following limit, provided that the limit exists. If it doesn’t 
exist we say that M does not have Dirichlet density. 


The expression “s — 1*” means that s approaches 1 through real values 
from above. 

It is clear from this definition that when 6(M) exists we have 
0< 6(M) <1. 

The denominator in this definition can be replaced with either log ¢x(s) 
or — log(1—s). To see this, consider the following calculation where through- 
out we assume #(s) > 1, and sums over P mean sums over all P in Sx. 


log ¢x(s) = S— 3 k* NP“ =S°NP* +S 3 k-INP-Fs , 
P k=1 P P k=2 


Let’s call the second sum Rx(s). We claim Rx(s) remains bounded as 
s— 17. To see this, set x = #(s) and note that 


IRx(s)|< S> 3 NP-** =S°NP~**(1—NP~*)"* 
P k=2 P 


<2S°NP~® < 2¢x (22) . 
P 


Since ¢x(s) is holomorphic for R(s) > 1, we see that ¢x% (2s) is bounded in 
a neighborhood of 1, which establishes our claim. 

Next, by Theorem 5.9, ¢x(s) has a simple pole at s = 1. Thus, 
lim,_.1+(s — 1)¢x(s) = ax # 0. Confining s to a small neighborhood of 1 
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we see that log((s — 1)¢x«(s)) = log(s — 1) + log ¢x(s) is bounded. Thus, we 
have shown 
SNP ~ log ¢x(s) * —log(s —1). 
P 

where f(s) © g(s) means that f(s) — g(s) is bounded in a neighborhood of 
1 (in particular, on an interval of the form (1,7r)). It follows that all three 
functions tend to infinity as s —> 1* and, also, that the ratio of any two 
tend to 1 as s — 1*. So, we have justified the claim that we could have used 
any of the three functions as the denominator in the definition of Dirichlet 
density. The function — log(s — 1) is particularly useful for some purposes. 
It does not depend on K! 

Certain properties of Dirichlet density follow easily from the definition. 
We have already mentioned one of them. We summarize those which will 
be needed later. 


Proposition 9.12. Let K be a global function field and M C Sx a set 
of primes. If the Dirichlet density of M exists, 0 < 6(M) < 1. If M 
is finite, 6(M) = 0. Also, 6(SxK) = 1. Suppose Mj, and Mog both have 
Dirichlet density. If these two sets differ by only finitely many primes, then 
6(M}) = 6(Moa). If M, GC Mo then 6(My;) < 6(Mo). If My A Mo = ¢, 
the empty set, then 6(M, U Mga) = 6(Mi1) + 6(Mg). 


The property involving disjoint unions extends to finitely many sets, 
but not to denumerably many sets! For each P € Sx let {P} be the set 
consisting of one element, P. Then, 6({P}) = 0 for every P, Sk = Up{P}, 
but 6(Sx) = 140. One must not think of Dirichlet density as a measure 
(in the technical sense) on the set of primes of K. 

We have enough information to prove an important special case of the 
Tchebotarev density theorem, and we proceed to do so. For much of the 
rest of this chapter we will fix a global function field K as base field and 
consider a finite Galois extension L of K. Given such an extension, we 
define {L} C Sx to be the set of primes in K which split completely in L. 
By our previous work, this can be characterized as the set of primes P of 
K, which are unramified in L and for which (P, L/K) = (e), the conjugacy 
class of Gal(L/K) consisting of the identity element. 


Proposition 9.13. Let L/K be a Galois extension of global function fields. 
The Dirichlet density of the set of primes in K which split in L is given 
by O({L}) = 1/[L: K]. If Ly and Lz are two Galois extensions of K and 
{Ii} = {Lo}, then Dy = Do. 


Proof. We consider the zeta function of L. We have, 
logéz(s)= S> SoktINp = SNP H +S OS RNB 
PEesp k=1 p B k=2 


The double sum is what we previously labeled Rz(s). This was shown 
to be bounded in a neighborhood of s = 1. In the sum that remains, group 
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the terms lying over a fixed prime P in K and we get 


logéx(s)= S” SO NBS. 


PESK P|P 


We can ignore the finitely many ramified primes. Set [LZ : K] =n. For the 
remaining primes we have f(/P)g(/P) =n. Thus, 


SE eee Dy > ae 


P $|P f\n PESK 
f(B/P)=f 


where we have used NY = NP/‘B/P), The sum of the terms with f > 1 
is bounded in a neighborhood of 1 and the sum of the terms with f = 1 is 
exactly nope {L} NP~%. Putting all this together we find 


log¢r(s) #(L:K] S> NP. 
Pe{L} 


Finally, divide both sides by —log(s — 1) and take the limit as s > 1+. We 
conclude that 1 = [L : K]6({L}) and so 6({L}) = [L: K]~* 

To prove the second part of the Proposition, consider the compositum 
L = LL». By Proposition 9.9, a prime splits completely in L if and only if 
it splits completely in L, and Ly. Thus, {L} = {L,}n {Lo} = {Li} = {Lo}. 
From the first part of the proposition we conclude that [Z : K] = [£1 : K| = 
[D2 : K]. Since Ly C L and Lz C L we have Li = L = Ly. The proot is 


complete. 


We note that to get the second part of the Proposition it would have 
been enough to assume {L;} and {2} differ by at most a set of Dirichlet 
density zero. This generalization is sometimes quite useful. 

We are now in a position to state the two different forms of the Tcheb- 
otarev density theorem that we have promised. 


Theorem 9.13A. (Tchebotarev Density Theorem, first version). Let L/K 
be a Galois extension of global function fields and set G = Gal(L/K). Let 
CCG be a conjugacy class in G and Si, be the set of primes of K which 
are unramified in L. Then 


#C 
#G ° 


In particular, every conjugacy class C' is of the form (P, L/K) for infinitely 
many primes P in K. 


b({P € Sk | (P,L/K) = C}) = 


Theorem 9.13B. (Tchebotarev Density Theorem, second version). Let 
L/K be a geometric, Galois extension of global function fields and set 
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G = Gal(L/K). Let C C G be a conjugacy class. Suppose the common 

constant field F of K and L has q elements. As above let Sy be the set of 

primes of K unramified in L. Then, for each positive integer N, we have 
AC g' (<-) 


#{P € Sk | deg P=N, (P,L/K) =C}= 755 +0( = - 


In particular, for every sufficiently large integer N, there is a prime P of 


degree N with (P,L/K)=C. 


In the second theorem, the hypothesis that the extension be geometric is 
not absolutely necessary, but it simplifies both the statement and the proof. 
The interested reader can investigate how matters should be modified to 
handle the general case. 

It will be seen that both of these theorems have considerable depth. 
However, the second is much stronger and much more difficult to prove. 

In fact, we will not prove either theorem completely, but will reduce both 
theorems to facts about Artin L-series. This may not be the easiest way 
to proceed, but is, perhaps, the most instructive. M. Deuring was able to 
prove the number field version of the first theorem by reducing to the case 
where Gal(L/K) is abelian by means of a very clever trick. The reader may 
wish to adapt this proof to function fields. See Lang [5], Ch. VIII, Theorem 
10, for an exposition of Deuring’s proof. 

Of course, before we can go forward along the lines indicated toward a 
proof of either theorem, we have to define Artin L-functions and discuss 
their properties. So, we do this first, and afterwards sketch the proofs. 

Let G = Gal(L/K) be the Galois group of a Galois extension of global 
function fields and p : G — Autc(V) a representation of G. Here V is a 
finite-dimensional vector space over the complex numbers C of dimension 
m. By choosing a basis of V over C we are led to an isomorphism Autc(V) = 
GLm(C). Thus, for o € G we can think of p(o) either as an automorphism 
of V or an m X m matrix with complex coefficients. The latter way of 
looking at things is more concrete, but depends on the choice of a basis. 
However, our definitions will only depend on the determinant and trace of 
such a matrix and these only depend on the automorphism. 

Let P be a prime of K which is unramified in L and let $B be a prime of 
L lying above it. We define the local factor L p(s, p) as follows: 


Lp(s,p) = det (I — p((8,L/K))NP~*) . 


Here, I is the identity automorphism on V and (8, L/K) is the Frobe- 
nius automorphism at %. By Proposition 9.10, we easily verify that the 
definition is independent of the choice of $8 above P. 

Let {a1(P),a2(P),--+ ,@m(P)} be the eigenvalues of p(($8, L/K)). Again, 
Proposition 9.10 can be used to show that these eigenvalues depend only 
on P and not the choice of 8 above P. In terms of these eigenvalues, we 
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get the following useful expression for the local factor at P: 
Lp(s,p)~* = (1 —a1(P)NP~)(1 — ag(P)NP~8)--+(1—am(P)NP~S) . 


We remark that since ($8, L/K) has finite order, these eigenvalues are 
roots of unity ( f($8/P)-th roots of unity, to be precise). 

Next, we must answer the question of what should be the local factors 
at P in the case when P is ramified in L. Let $8 lie above P and set 
Z = Z(/P) and I = I(98/P) (the context should keep this use of “J” 
separate from its use as the identity automorphism). We recall the exact 
sequence: 


(e) > I> Z > Gal(Eg /Fp) - (e) . 


Let V' = {v € V | p(r)v = v, Vr € I}. This is a vector subspace of 
V. Let yp be any element in Z which maps to ¢p € Gal(Eg/Fp), the 
automorphism defined by raising to the NP power. We define the local 
factor at P by 


Lp(s, p) = det (I’ — p(yp)|v:NP78) 


Here, I’ is the identity automorphism on V‘. Since any two choices of yy 
differ by an element in J, the definition of Lp(s, p) is unaffected. As before, 
the definition is also unaffected by the choice of $B lying above P. 

Let m’ be the dimension of V’ and {a1(P),ae(P),-+: ,Qm/(P)} the 
eigenvalues of p(y). These, indeed, depend only on P, and we have 


Lp(s,p)~* = (1 — a1 (P)NP7*)(1 — a2(P)NP7S) +++ (1 — am (P)NP™®) . 


We remark that m’ < m and, once again, the eigenvalues are all roots of 
unity. 

Having defined the local factors for all P € Sx we now define the Artin 
L-series associated to the representation p by the equation 


L(s, p) = l] Lp(s,p) - 


PESK 


Suppose p = Po, the trivial representation. This means that V is one 
dimensional and p,(c) is the identity for all o € G. It follows easily from 
the definitions that L(s, p,) = ¢x(s). 

Another interesting representation of G is the regular representation Preg: 
In this case V = C[G], the group ring of G over C and for all o € G, preg (o) 
is given by left multiplication by o. It can be shown that L(s, preg) = C1 (s). 
We will return to these matters later. 

Suppose (V,) and (V’, p’) are isomorphic representations. This means 
there is an isomorphism p : V - V’, such that for allvu € V anda €G 
we have uu(p(a)(v)) = p'(o)(u(v)). It is easy to see that if p and p’ belong 
to isomorphic representations, then L(s,) = L(s,p’). It follows that the 
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[-series depends only on the character x of the representation. Recall that 
if (V, p) is a representation, the corresponding character is given by x(a) = 
trace(p(a)) for alla € G. y is a complex-valued function on G. It is a class 
function in the sense that y(r~!or) = x(c) for all o,r € G. It is easily 
seen that two isomorphic representations have the same character. We will 
write L(s,p) = L(s,x). 


Lemma 9.14. Let (V,p) be a representation of G = Gal(L/K) where 
L/K is a Galois extension of global function fields. Let P be a prime of K 
unramified in L. Then 


log Lp(s, x) “ra —_ ; 


where x(P*) means x((%8, L/K)*) for a prime lying over P. 


Proof. If {a;(P), a@2(P),--+ ,Qm(P)} are the eigenvalues of (%, [/K), then 
we showed earlier that Lp(s,y)~* = [];~,(1 — ai(P)NP75). Taking the 
logarithm of both sides and using the identity —log(1—X) = 072. k71X*, 
we find 


3 ie Oy P k 1 
k=1 


The sum )>"", a;(P)* is equal to the trace of p((B, L/K)*), which is x(P*) 
by definition. 


The reader may wish to give a similar expression for log Dp(s,~) when 
P is ramified (see Artin [2] or Lang [5], Chapter XII). 

Up to now we have been treating everything in a formal manner and not 
worrying about where these new L-series are defined. It is relatively easy 
to provide some information by using the comparison test. 


Proposition 9.15. With the above notation and conventions, L(s, x) con- 
verges absolutely in the region R(s) > 1 and for every 6 > 0 it converges 
absolutely and uniformly in the region R(s) > 1+6. Consequently, L(s, x) 
1s holomorphic and non-vanishing for all s with R(s) > 1. 


Proof. An infinite product [[”~ ,(1+a,) converges absolutely if and only 
if 77-1 |@n| converges. Using the local decomposition Lp(s) = [Ji ,(1 — 
a;(P)NP~*)~!, we can use this criterion together with the fact that the 
zeta function ¢x(s) = [[p(1—NP*)~' converges absolutely in the region 
¥(s) > 1 to show the same holds for L(s, x). Since each of the local factors, 
Lp(s,x), is non-vanishing in that region, the same holds for the product, 
L(s,x). 

The statement about uniform convergence can be proved in a similar 
fashion. 
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To prove the Tchebotarev Density Theorem we need more information 
on the analytic properties of Artin L-series. We present the next two the- 
orems in parallel with the two versions of the Tchebotarev Theorem, i.e., 
Theorems 9.13A and 9.13B. 


Theorem 9.16A. (E. Artin) Let L/K be a Galois extension of global fields 
(either function fields or number fields) and L(s,x) a corresponding Artin 
L-series. Then, for some positive integer n, L(s,x)" has a meromorphic 
continuation to the whole complex plane. Moreover, if x is a non-trivial 
irreducible character, then L(s, x)” is holomorphic and non-vanishing in a 
neighborhood of s = 1. 


Theorem 9.16B. (A. Weil) Let L/K be a geometric, Galois extension of 

global function fields. Denote by q the number of elements in the constant 

field. Let L(s,x) be a corresponding Artin L-series and assume that x is 

irreducible and non-trivial. Then L(s,x) is a polynomial in q~*. In partic- 

ular, this implies that L(s,x) has a holomorphic continuation to the whole 
s 


complex plane. Moreover, denoting by m the degree of L(s,x) in q™*, we 
have 


L(s,x) =][Q—mlda’) , 


i=l 
where for each i with 1 <i<m, |m(x)| = /@. 


We will not prove either of these results. In the case of Artin’s theorem, 
we will show later how to reduce the proof to the case of one-dimensional 
characters and how, via Artin’s reciprocity law, the result can be made 
to follow from Hecke’s work on another type of L-series. Our main goal 
is to show how to use Theorem 9.16A to prove Theorem 9.13A and how 
to use ‘Theorem 9.16B to prove Theorem 9.13B. First, however, a series 
of remarks. These remarks are not needed in the proofs, so the impatient 
reader can simply skip over them. 


1. Artin deduced Theorem 9.16A by means of a theorem on group char- 
acters. Namely, he showed that any complex character of a finite group G 
can be written as a rational linear combination of induced characters from 
cyclic subgroups of G. See Serre [3] for the definition of induced character 
and the proof of this theorem (Chapter 9). From this it follows that there 
is an integer n > 0 such that L(s, x)” can be written as a product of Artin 
L-series corresponding to one-dimensional characters divided by another 
such product. Since, via Hecke’s work and the reciprocity law, he knew the 
result to be true for one-dimensional characters, the meromorphic contin- 
uation follows. Using the same ideas he deduced L(1,v) #4 0 by reducing 
to the case of one-dimensional characters. 


2. Strictly speaking, Theorem 9.16A only gives information about L(s, y)” 
about s = 1, not L(s, x) itself. However, the result implies that on any real 
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interval (1,t), L(s, x) is bounded from above and is bounded away from 0. 
This is all we need to prove Theorem 13A. 


3. In 1947 , R. Brauer proved a much stronger theorem on group char- 
acters. Namely, if x is any complex character on a finite group G, then x 
can be written as a Z-linear combination of characters induced from one- 
dimensional characters on “elementary subgroups” (again, see Serre [3], 
Chapter 10, for the definitions and proof). This sufficed to show that any 
Artin [-series has a meromorphic continuation to the whole complex plane, 
i.e., the troublesome “n” in Artin’s theorem can be taken to be 1. Brauer’s 
result did not give Artin’s conjecture that L(s,x) has a holomorphic con- 
tinuation to the whole complex plane when x is irreducible and non-trivial. 
In the number field case, this remains an open conjecture. In the function 
field case, Weil proved it in the precise form given by Theorem 9.16B, using 
algebraic-geometric methods. 


4. In Theorem 9.16B, once the first part of the theorem has been estab- 
lished, the second part, about the size of the inverse roots, follows from 
an important, but not deep, property of Artin L-series and the Riemann 
hypothesis for function fields. 

Let G be a finite group and YXreg the character of the regular repre- 
sentation described earlier. Let {x1,x2,--- ,xXg} be the set of irreducible 
characters of G. We set x1 = Xo, the trivial character. Denote by d; the 
degree of x;, i-e., d; = yi(e) = the dimension of the representation space 
corresponding to y;. In this language, the one-dimensional characters are 
those of degree 1. It is well known that yreg = >7_, dix; (See Serre [3], 
Chapter 2). 

To avoid awkward notation, we consider a geometric, Galois extension 
of function fields M/K (not L/K for now). Let G = Gal(M/K). Then, 
using the result about group characters given in the last paragraph, formal 
properties of Artin L-series, and L(s, Xreg) = Ci(s), one deduces 


g 


Cm (8) = ¢x(s) [] £6, xi)” 


i=2 


Assuming the first part of Weil’s result, set, for 2 <i < g, L(s,xi) = 
P(u, xi), @ polynomial in u = q~*’. Now use Theorem 5.9, which describes 
the form of the zeta function of a global function field. Substituting into 
the last equation, we get 


By the Riemann hypothesis for global function fields (see Theorem 5.10), 
and the fact that M/K is a geometric extension, the inverse roots of Lyy(u) 
all have size \/q. The right-hand side of the above equation is a product of 
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polynomials, so each of these polynomials must have inverse roots whose 
absolute value is \/q. 


5. Weil was able to determine the exact degree m,; of the polynomials 
P(u, x:). The answer is that for 2 <i <g, m; = di(2gx —2)+deg, F(x). 
Here, F'(x;) is an effective divisor of K called the Artin conductor of the 
character y;. We will not define it here, but the interested reader can consult 
Serre (2|, Chapter VI. We will give the definition later in the special case 
where x is a linear character. By considering the last relation in Remark 4 
above, and taking degrees, we find 


| g 
29m —2= |M: K](2gxK —2)+ S di deg F (xi), 
i=2 


a relation which is the function field analogue of the conductor-discriminant 
theorem of algebraic number theory. (We have used [M : K] = ><%_, d? 
which follows from X;eg = an d;x; by evaluating both sides at the iden- 
tity element e). 


We now return to our main business. 


Lemma 9.17. Let G be a finite group and C C G a conjugacy class of G. 
Leto EC andt €G. Then 
2x) (7) =0ifréCand #E if Tec, 
EC 


where the sum is over all irreducible characters of G. 


Proof. This is one of the two standard othogonality relations among char- 
acters of finite groups. See Lang [4] or Serre [3). 


We have all the tools we need to give a proof of the first form of the 
Tchebotarev Density Theorem. 


Proof of Theorem 9.13A. Let x be any irreducible character and define 
L*(s,x) = [I pe si. Lp(s,x). We have omitted the finitely many factors 
from the product defining L(s, .) which correspond to primes of K ramified 
in L. It is still true that L*(s, 0) has a simple pole at s = 1 (since it differs 
from ¢x(s) by a factor which is holomorphic and non-vanishing at s = 1) 
and that L*(s, x) is bounded and bounded away from 0 on any real interval 
of the form (1,t) if x # xq is irreducible. This follows from Theorem 9.16A. 
By Lemma 9.14, we have, for R(s) > 1 


log L*(s,x) = S- log Lp(s, x) = S- > PS 


PES‘, PES; = 
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For any element 7 € G, we have |x(7)| < d, the degree of y. This follows 
since x(7T) is the sum of d roots of unity. From this, we see 


» Ee = dy > PE dRxK (x) . 


PES), k= PES 


Just as in a previous discussion, after the definition of Dirichlet density, 
one shows Rx(xr) < 26K (2x). Since this is bounded in a neighborhood of 1 


we deduce (P) 
log L*(s, x) ~ S- Tae . 
PES, 


Choose an element o € C and multiply both sides by y(o) and add the 
result over all irreducible characters y of G. Making use of Lemma 9.17, 
we obtain 


—— , #G 1 
S_ x(c) log L*(s, x) © EG , NPs” (*) 
x P, (P,L/K)=C 
Since L*(s,x.) has a simple pole at s = 1 and for the other irreducible 
characters, L*(s, y) is bounded and bounded away from zero on any interval 


of the form (1,¢) we have 


log L"(s,X0) _ and lim log L*(s, x) 


= 0 O° 
sit —log(s — 1) s—1+ —log(s — 1) Jorx#x 


In equation (*) above, divide both sides by —log(s — 1) and take the 
limit as s + 17. The result is 


_#G 
#C 


which concludes the proof of the theorem. 


l= F5 6UP € &, | (P,L/K) = C}) , 


The exact same proof works equally well in algebraic number fields. The 
reader will not fail to notice how similar this proof is to the proof of Dirich- 
let’s theorem, Theorem 4.7. We shall indicate below how the two results 
are connected. In fact, Theorem 9.16A should be thought of as a vast gen- 
eralization of Dirichlet’s theorem. First, however, it is time to prove the 
second form of Tchebotarev’s density theorem. 


Proof of Theorem 9.13B. We begin by making a remark about Lemma 
9.14. We proved it assuming P is a prime of K unramified in L. We would 
like to take the ramified primes into account as well. Using the definition 
of the local factor of an Artin L-series at a ramified prime we found in this 


case also one can write Lp(s, x)~! = 1.1 —a;(P)NP~*) where the a; 
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are roots of unity and m’ < m = the degree of y. We define x(P*) to be 


ae a®, This coincides with the definition in case P is unramified and 
with this definition the formula of Lemma 9.14 is valid for all primes. 

We next calculate the logarithmic derivative of D(s,x) assuming 
R(s) > 1. From the Euler factor definition, L(s,x) = [[pes, Lp(s,x); 
we find, using Lemma 9.14, that 


oe) x¥ Pk 
los Lax) = Yo Ae 
PESK k=1 


We now switch to the variable u = q~* and, by abuse of notation, write 
L(s,x) = L(u, x). The above relation becomes 


log L(u, x) _ S- 3 X(P*) deg P 


k 
PESK k=1 


Take the derivative of both sides and multiply the resulting equation by 
u. We find 


L' U, oo ; [e @) a 
es = S- S "deg P x(P*)uk dee P — Sen (x)u . 
“X) PES, k=1 n=l 
The coefficient c,(xv) of u” is given by 
cx) = So degP x(PY es?) () 


P, deg P|n 


We write this as 


cr(x)=n( SY) xX(P)) + Ralx) (2) 


P, deg P=n 


and we will show later that R,(x) = O(q"/?). 

The main idea of the proof is to express c,(x) in another way using the 
zeros and poles of the various L-series whose size we know something about 
because of the Riemann hypothesis for function fields and Theorem 9.16B 
above. From this it will turn out that 


en(x) = 9"6(x,X0) + O(g"/?) , (3) 


where 6(y, Xo) = 1 if v = xo and is 0 otherwise. 

Assuming these facts about c,(x) we will now show how to complete 
the proof. Afterwards we will give the details behind these two separate 
evaluations of c,(x). 

Combining equations 2 and 3, we find 


q5(XsX0) FO(G"’?) =n( SJ x(P)) +09") . (4) 
P, deg P=n 
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There are only finitely many ramified primes in D/K so for all n sufficiently 
large there are no ramified primes of degree n. Thus, from some point on 
x(P) = x((P, L/K)) for all primes of degree n. We assume n is at least this 
big. Now, choose an element o € C' and multiply both sides of Equation 4 
by x(a) and sum over all irreducible characters yy. Using Lemma 9.17, we 
deduce 


gq’? +O(q"/?) =n #GC wep E Sx | deg P =n, (P,L/K) =C}+O(q"””) . 


#C 
Divide both sides of the equation by n#G/#C, combine the error terms, 
and, subject to the proofs of 2 and 3, the theorem follows. 
We now proceed to show the validity of the two expressions we have given 
for cn (x). Consider first Equation 2. From Equation 1 we get the following 
explicit expression for R,(x): 


Ri(x)= > degP x(Prise?) | 
deg Pin 
deg P< n 


If h is the degree of x, then, as we have seen, |y(7)| < h for all rT € G. So, 
taking absolute values and using the triangle inequality, we get 


Ra(x)i<h SD daa(K). 


d|n,d<n 


Recall that ag(K) is the number of primes of K of degree d. We know that 
dian €aa( KX) = Np(K). It follows that 


[Rn (x)| S h|Nn(K) — nan(K)] . (5) 


By the analogue of the prime number theorem, Theorem 5.12, we know 
nay, = q” + O(g"/?). Since N,(K) = q*™+1- S079, m7, where for each 4 
we have |m;| = ,/g, we also have N,,(K) = g” + O(q"/"). It follows that 
|N,(K)—nan(K)| = O(q"/*). The required estimate R,(x) = O(q"/) now 
follows from Equation 5. 

The final step is to prove the estimate for c,(x) in Equation 3. We begin 
with the trivial character y,. As we have seen, the Artin L-series for xq is 
just the zeta function of K. Thus, 


Lx (u) 
(1 —u)(1 — qu)’ 
where Lx(u) = []{2,(1 — mu) and for each i, || = /q. Taking the loga- 


rithmic derivative of both sides, multiplying by u and equating coefficients, 
we find 


L(s,Xo) = 


29 
Cn(Xo) =q" +1—- Son? =? +O(q™?) . 


w=l1 
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This verifies the estimate for cn (x .). We have done this calculation much 
earlier, in the proof of Theorem 5.12, and we have used the result in a 
different context in the last paragraph. 

If y # x is irreducible, then by Weil’s result, Theorem 9.16B, we can 
write L(u, x) = [];2,(1—-7(x)u) where m is the degree of L(u, x) and each 
T(x) has absolute value ,/q. Taking logarithmic derivatives, multiplying 
by u, and comparing coefficients we derive 


Cn(X) =~ da mi(x)” . 


From this it is clear that c,(x) = O(q”/?). 
Theorem 9.13B is proved. 


We have been content to be somewhat careless about the error term. It 
can be estimated effectively by keeping careful track of constants at each 
step of the proof. The interested reader can try working this out or he/she 
can consult Murty and Scherk [1]. 

The method of proof is often used in analytic number theory. We have an 
arithmetic L-series which is defined by an Euler product over primes. One 
then tries to continue the function to be analytic on the whole complex 
plane. One then writes the same function as a product over its zeros and, 
when they exist, poles. Taking the logarithmic derivative of both product 
expansions and comparing the results usually leads to important results. 
This idea goes back to Riemann. It has been a very fruitful method. 


For the rest of the chapter we will treat the case of abelian extensions of 
global function fields. For the most part we will be content to sketch this 
beautiful theory, but from time to time complete proofs will be supplied. 
Our main objective is to set out the connection between Artin L-series 
associated to abelian extensions and Hecke L-series (to be defined below). 
This is fundamental to any deeper understanding of the material we have 
covered up to now. 

From now on we assume that L/K is a finite, Galois extension of global 
function fields and that the Galois group, G = Gal(L/K), is an abelian 
group. As before, E will be the constant field of L and F the constant field 
of K. We set q = #F and m = [E: F]. If P is a prime of K and $B; and 
‘85 are two unramified primes of L lying above P, then by Proposition 9.10 
the two Frobenius automorphisms (81, L/k) and (Bo, L/K) are conjugate 
in G. Since we are assuming G is abelian, these two automorphisms are 
equal. Thus, the conjugacy class (P, L/K) contains only one element. We 
identify this conjugacy class consisting of one element with an element of 
G. The automorphism (P, [/K) is called the Artin automorphism at P. Let 
Si C SK denote the set of primes in K which are unramified in L. Then, 
P + (P,L/K) is a well-defined map from S; to G. Let Dy C Dx be the 
divisors of K whose support lies in S,. Then, by linearity, P > (P, L/K) 
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extends to a homomorphism from Di, — G which is called the Artin map, 
(x, L/K). To be explicit, if D € Dy, then 


(D,L/K) =( )< a(P)P,L/K)= |] (P,L/K) . 


PES\, PES 


Proposition 9.18. The Artin map (*,L/K): Dy — G is onto and 
the kernel contains the group Nr/;xKD{, where Dy is the subgroup of Di 
generated by primes of L unramified over K. 


Proof. Let G’ denote the image of (*,L/K) and M Cc L the fixed field 
of G’. If P € S; then, by Proposition 9.11, (P,L/K)|m = (P,M/K). By 
definition, (P,L/K)|y = e. Thus, (P,M/K) = e which implies that P 
splits completely in M. Since S; has Dirichlet density 1, it follows from 
Theorem 9.13 that M = K. Galois theory now yields that G’ = G, ie, 
(x, L/K) is onto. 

If B is a prime of L lying above P, then by the definition of the norm map 
(see the discussion following Proposition 7.6) we have Nr/« PB = f(PB/P)P. 
Thus, 

(Nik P,L/K) = (P,L/K)IP =e. 


The last equality is a consequence of Proposition 9.10 which asserts that the 
Frobenius automorphism (8, Z/K) (and so (P,L/K)) has order f(B/P). 


The second assertion of the proposition follows from this. 


The exact nature of the kernel of the Artin map is a very difficult ques- 
tion. We first turn our attention to a much simpler question. Among abelian 
extensions of K the simplest are the constant field extensions. How does 
the general theory play out in this special case? The key to answering this 
question is to determine explicitly the Artin automorphism (P, KE/K). 

Recall that Gal(E/F) is cyclic of order m generated by the automorphism 
@q which maps a — a® for all a € E. We have shown previously that 
Gal(KE/K) = Gal(E/F). From now on we identify these two groups. 


Proposition 9.19. Let L = KE where E is an extension of F of degree 
m. Let P be any prime of K. Then (P,L/K) = peeen P 


Proof. Every prime of K is unramified in L since L is a constant field 
extension. See Proposition 8.5. 

Suppose a € E. From the definition, (P,L/K)a =a” (mod $), where 
Bis a prime of L above P. Both sides of this congruence are in E and thus 
the difference is in EN B = (0). It follows that 


(P,L/K)a=aN? VackE. 


Now, NP = q%8%«?. It follows that the right-hand side of the above 


equality coincides with pycex (a). Since a € E is arbitrary, we deduce 


(P,L/K) = ¢9°8*”. 
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Proposition 9.20. Maintaining the notation of the previous proposition, 
the Artin map (*,L/K): DK — Gal(L/K) = Gal(E/F) ts onto and the 
kernel is the group DE. Dy. Here, D&, denotes the group of divisors of degree 
zero. 


Proof. We already know that the map is onto. To determine the kernel 
we note that the Artin map is given by (D,L/K) = pyreK ° for D € Dr. 
This is true for prime divisors by the previous proposition and it follows in 
general by linearity. From this we see D%, is in the kernel. Since Gal(L/K) 
has order m, it follows that DZ is also in the kernel. Thus, D&D¥ is in 
the kernel. This group is equal to the kernel since it has index m in Dr. 
This follows since DZ, DF is the kernel of the map 


D-~ deg, D (modm) from Dx > Z/mZ. 


We note, for future reference, that Px, the principal divisors of K, have 
degree zero and are thus in the kernel of the Artin map for constant field 
extensions. 

We can now determine the Artin L-functions associated to constant field 
extensions. 


Proposition 9.21. Again maintaining the notations and hypotheses of 
Proposition 9.19, let x be an irreducible character of G = Gal(L/K). Then, 
L(s,x) = ZK (x(¢q)u) where, as usual, u=q7*, and ZK (u) = Cx(s). 


Proof. Since G is abelian, x is a linear character, i.e., a homomorphism 
from G to C*. From the definitions, and Proposition 9.19, 


L(s,x) = |] G@-x((P,L/K))NP~*)"* 
PESK 
_ l] (1 _ V(b 9) 28K Pa-s deg x Py-1 
PESxK 
= TT A ~ x(eq)uy®*?)t = Zi (x(Gq)u) 
PESK 


This proposition gives a meromorphic continuation of L(s,x) to the 
whole plane, which is good. However, all of these functions have poles, 
which seems to be bad. 

This result seems troubling at first sight. If x = x, the trivial character, 
the Artin L-function, is the zeta function, which is as it should be. If y is not 
trivial, then L(s, y) is not a polynomial in u and in fact has poles (at s € C 
such that y(¢,)qg~* = q_! or 1). This seems to contradict Artin’s conjecture 
and Weil’s theorem. It does not contradict Weil’s theorem, Theorem 9.16B, 
because part of the hypothesis was that L/K be a geometric extension, 
i.e, L and K have the same constant fields. Artin’s conjecture has to be 
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modified in the function field case to accommodate characters belonging to 
constant field extensions. We will explain this more fully later. 

It is worth noting that a consequence of Proposition 9.21 is that for 
constant field extensions L(1,x7) # 0 if y is linear and non-trivial. This 
follows because 


L(1,x) = Zx(x(bq)q7*) #0, 


since Cx(s) has no zeros on the line #R(s) = 1 by Proposition 5.13, and so 
Zx(u) has no zeros on the circle |u| = qu}. 

Another interesting consequence, which is a special case of a far more 
general result (which we discussed in Remark 4 following the statement of 


Theorem 9.16B) is that 


Cr(s) — [[£6.» ) 


where the product is over all linear characters of G. This is immediate from 
Theorem 8.15 and Proposition 9.21. 


Having investigated constant field extensions the next question is to see 
how they fit into the more general situation. Let L/K be a general abelian 
extension with E the constant field of L and F the constant field of K. 
Then KE is the maximal constant field extension of K in L. L/KE is a 
geometric function field extension. 


Proposition 9.22. Let L/K be an abelian extension of global function 
fields and KE be the maximal constant field extension of K in L. Let G = 
Gal(L/K) and G' the image of Dig under the Artin map. Then, G’ = 
Gal(L/KE). In particular, if L/K is a geometric extension, Di2 maps onto 
G under the Artin map. 


Proof. Let P be a prime of K which is unramified in ZL. By Proposition 
9.11 and Proposition 9.19 we see (P,L/K)|kg = (P, KE/K) = pacen 
By linearity, if D € Di,, then (D,L/K)\xr = o°8K It follows that Dir 
maps to Gal(L/KE), i.e., G’ C Gal(L/ KE). 

To show the Artin map from D?? to Gal(L/KE) is onto is a little tricky. 
We first need a subsidiary result, namely, that m = [KE : K] is the greatest 
common divisor of the degrees of the primes in {LZ}, the primes of K which 
split completely in L. Note first that if P € {L} then (P, L/K) =e. Thus, 
(P, KE/K) = e and this occurs if and only if m|degy P. Let m’ be the 
greatest common divisor of the degrees of primes in {ZL}. We have just 
shown m|m’, and we want to show m = m’. To do this, consider the finite 
field IE’ > E whose degree over F is m’. Every prime in {L} splits completely 
in KE’ since the degree of any such prime is divisible by m’. The field LE’ 
is a Galois extension of K since it is the composite of two Galois extensions 
of K, L and KE’. A moment’s reflection shows that {ZL} = {LE’}. By 
Proposition 9.13, this implies that L = LE’, ie., E’ c L. This shows that 
EK’ = E and it follows that m = m’. 
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From what we have proven, it follows that there are primes P), P2,::- , 
P, € {L} and integers a1, a2,:-+ ,a; € Z such that 


t 
Sa degy Pi; =m. 
=] 


Set C = S\\_, aiP;. Then, C € Dy and degy C = m. Also, (C,L/K) =e 
since every prime in the support of C' splits completely in L. 

To finish the proof, choose o € Gal(L/KE). By Proposition 9.18, 0 = 
(D,L/K) for some D € Di. Since a is the identity on KE it follows from 
e= 0|lke = (D, KE/K) = pyen ? that m| deg; D. Suppose degy D = 
km with k € Z. Then D —kC has degree zero, and 


(D—kC,L/K) =(D,L/K)(C,L/K)~“* =ce“* =o. 
The proof is complete! 


Let S' be a finite set of primes in a global function field K and F = 
> peg h(P)P an effective divisor of K with support in S. We define the 
ray modulo F, P*, to be the set of principal divisors of K generated by 
elements «x € K* which satisfy 


ordp(x—1)>hA(P) VP ES. 


Clearly, the ray modulo F is a subgroup of the group of principal divisors 
Px. In fact, it is a subgroup of P(S), the principal divisors of K whose 
support is disjoint from S. Let D(S) C Dx be the group of divisors whose 
support is disjoint from S. 

The ray class group modulo F, Cl, is defined to be the quotient D(S)/P* . 
This group is not finite. However, there is an exact sequence 


(0) > ClE > Cle — Z— (0) 


induced by the degree map. It can be shown, using the finiteness of the 
divisor class group of degree zero, that Cl% is a finite group. The first step 
in the proof (which we shall not pursue) is to show the following sequence 
is exact: | 


(0) > P(S)/P? > CIZ 4 CI& = (0) , 


where the map from Cl% to ClX% is as follows: given a ray class in Cl, 
find a divisor representing it and map that divisor to its class in Cl%. We 
know that this latter group is finite, so it all comes down to showing that 
P(S)/P* is finite. This is not difficult. 

The relevance of these notions comes from the following theorem, which 
is one form of the Artin reciprocity law. 
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Theorem 9.23. (E. Artin) Let L/K be a finite abelian extension of global 
function fields. Let S be the set of primes’ of K which are ramified in L. 
Then the Artin map, (*,L/K), takes D(S) onto Gal(L/K) and there is 
an effective divisor F supported on S such that the kernel of the map is 
P? Nr KD. 


As we have already mentioned, this is a very deep result whose proof is 
long and involved. We have proved a portion of the Theorem in Proposition 
9.18. The Artin map is onto and the norms of divisors are contained in the 
kernel. The exisitence of a divisor F such that P* N;, /K Dy, is the kernel is 
the hard part. We will not prove it here, but, accepting its truth, we will 
derive some consequences. 

Notice that another way to state the same thing is that the Artin map 
takes the ray class group Clz onto Gal(L/F) and the kernel is generated 
by the classes of the norms of unramified primes in L. With minor modifi- 
cations, the same result holds in algebraic number fields. What is required 
in this case is some attention to the archimedean primes. These do not 
exist in function fields. 

How unique is the divisor F which plays such a major role in the Theo- 
rem? It turns out it is not unique. However, one can show that in the set of 
all effective divisors with the same property as F there is a minimum one 
(recall that one divisor is greater than or equal to another if their difference 
is effective or zero). This minimum divisor, which we continue to denote 
by F, is called the conductor of L/K. Sometimes one writes this as F;, /K- 

Our next goal is to define Hecke L-series and then, using Artin’s theorem, 
connect these to Artin L-series. 

Let F be an effective divisor with support S C Sx. A character of finite 
order on Cl¢ is called a Hecke character modulo F. There is a more general 
notion of Hecke character which is very important, but we will confine our 
attention to those which satisfy the definition just given. 

Another way to phrase the definition is to say a Hecke character modulo 
F is a homomorphism from D(S) — C* whose kernel is a subgroup of finite 
index containing the ray P7. 

Let A be a Hecke character modulo F in the sense just given. We want 
to define a Hecke L-series, L(s, X). The definition suggests itself. Let P ¢ S 
be a prime of K. Define A(P) to be A evaluated on the ray class in Clr 
containing P. Then, define 


L(s,A)= [] Q-XA(P)NP)?. 
P¢S 


Since |A(P)| = 1, one sees easily by the comparison test that L(s, ) con- 
verges absolutely for R(s) > 1 and for every 6 > 0 it converges absolutely 
and uniformly in the region R(s) > 1+. It follows that Hecke L-series are 
entire function of s in the region #(s) > 1. Since the terms of the product 
are non-vanishing in that region, the same is true for L(s, A). The following 
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result is essentially due to Hecke. He proved the analogous result in the case 
of algebraic number fields. The details of the function field version were first 
worked out by F.K. Schmidt. Nowadays, one can give a uniform proof of 
both versions simultaneously (and for the most general Hecke characters). 
This was done by J. Tate [2] in his thesis. A more classical approach to the 
function field case can be found in Deuring [1] and Moreno [1]. We will give 
the proof of a special case below (Proposition 9.26). 


Theorem 9.24. Let X be a Hecke character modulo F and assume that » 
is not trivial on D°(S). Then L(s, A) 1s an entire function of s. In fact, it 
is a polynomial in q~*. Moreover, L(1,) 4 0. 


This is actually a rough version of the full result which includes a beau- 
tiful functional equation that is satisfied when the character 4 is primitive. 
We will briefly explain what this means and write down the functional 
equation. 

Suppose F’ < F are two effective divisors. It is easy to see that there is 
a natural map 7: Cle > Cle. If \’ is a character of Cle then A = ’ om 
is a character of Clr. X is said to be induced from X’. 

A character 4 modulo F is said to be primitive if it is not induced from 
a character of any properly smaller modulus. In this case F is said to be 
the conductor of A and we write F = Fy. 


Theorem 9.24A. Let X be a primitive Hecke character with conductor Fy 
and suppose A is not trivial on D°(S). Then L(s, A) is a polynomial in q-° 
of degree 2g —2+ deg, Fy. Define A(s, A) = g9- D8 NFA L(s, A). Then 


A(s, A) = €(A)A(1 — 8,d) , 
where €(A) is a complex number of absolute value 1. 


We are finally in a position to explain why, for linear (one dimensional) 
characters, Artin L-series are “the same as” Hecke L-series. 

Let L/K be a finite, abelian extension of global function fields, G = 
Gal(L/K), and x a linear character on G. We want to show that the Artin 
L-series L(s,x) is equal to a Hecke L-series. As a first step, let Ny C G 
be the kernel of x and let K, Cc L be the fixed field of N,. It is almost 
immediate that Gal(K,,/K) is cyclic of order equal to the order of x in the 
character group of G. We set Gal(K,,/K) = G,, and note that x gives rise 
to a character on G, = G/N,. We call this character x as well. Let F, 
be the conductor of the extension K,/K. By Artin’s theorem, Theorem 
9.23, the Artin map, (*, K,/K), gives a homomorphism from Clz, onto 
Gal(K,/K). Call this homomorphism p and set \ = yo p. Then X is a 
homomorphism from Clz, to C*, i.e., a Hecke character modulo F,. It can 
be shown that A is a primitive character modulo Fy, i.e., Fy = Fy. With 
all this in place, it now follows directly from the definitions that 


D(s,x) = L(s,A) . 
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This is the long awaited identification of an Artin L-series associated to a 
linear character with a Hecke L-series. We can now invoke Theorem 9.24 to 
establish the analytic continuation of L(s, x) to the whole complex plane. 
It remains to explain when this continuation is entire. 


Theorem 9.25. Let L/K be a finite, abelian extension of global function 
fields, G = Gal(L/K), and xy a linear character of G. Then, L(s,x) has 
an analytic continuation to an entire function in the whole complex plane 
if and only if K,/K is not a constant field extension. 


Proof. From the discussion preceding the statement of the theorem, 
L(s,x) = L(s,A) where \ = xo p. Here, p: Clr, 4 Gy and x: G, > C*. 
Let G, be the image of Clr in G,. By Proposition 9.22, the fixed field of 
G‘, is the maximal constant field extension of K inside of Ky. 

Since p is onto A is not trivial on Cle if and only if y is not trivial on 
G).. Since x is one to one on Gy we see x is trivial on G\ if and only if G, 
is trivial and this happens if and only if K, is a constant field extension. 
Equivalently, x is not trivial on Gj if and only if Ky /K is not a constant 
field extension. 

Thus, if A,/ is not a constant field extension, A is not trivial on Clr 
and by Theorem 9.24 this shows L(s,A) = L(s,x) is entire. If K,/K is 
a constant field extension, Proposition 9.21 shows L(s, x) is meromorphic, 
but not entire. 


Theorem 9.25 gives a precise understanding of when L(s, x) fails to sat- 
isfy Artin’s conjecture in the function field case. 

In the exercises we will outline the relationship of Hecke L-series to the 
Dirichlet L-series, which were introduced and investigated in Chapter 4. As 
it turns out, the latter are simply a special case of the former. 

We will conclude this chapter by proving a portion of Theorem 9.24, 
namely for those Hecke characters belonging to the trivial modulus. In this 
case, the ray is just the group of principal ideals, Px, and the ray class 
group is Dx /PK = Cl, the class group of K. So, we will consider L- 
series attached to characters of finite order of the class group of K. In this 
case, the analytic continuation and the functional equation follow from the 
Riemann-Roch theorem by using the same ideas that went into the proof 
of ‘Theorem 5.9. 


Proposition 9.26. (special case of Theorem 9.24A) Let X be a character of 
finite order of Clz and suppose that X is not trivial on Cl. Then, L(s, X) 
is a polynomial in q~° of degree 2g —2. Set A(s,A) = q'9~))8L(s, A). Then, 
A(s,X) = A(C)A(1 — s,), where C is the canonical class of K. 


Proof. L(s, A) = [[p(l1—A(P) NP78)7? = 30, A(A)NA75 where the prod- 


uct is over all primes of K and the sum is over all effective divisors of K. 
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Summing by degrees we find 


OO 


L(s,\)=S>{ S- XA)] ao 


We claim )iaeg,, A=k (A) = 9 for all k > 2g — 2. This will show that 
L(s, x) is a polynomial in q~* of degree at most 2g — 2. 

Assume k > 2g — 2 and let {Aj, Ao,-:--,An} be a set of divisors of 
degree k representing the divisor classes of degree k. Here, h = hx is the 
class number of K. For any two divisors B; and Bp» we will use the notation 
B, ~ By to mean that B,; and Bo are linearly equivalent, i.e., B; — Bo is 
principal. All sums will be over effective divisors A. We have 


h k—-gt1 4 
Sy v4)= ( S sa) - PAY) | 


degy A=k ti=1 \AnWA; i=1 


We have used two facts. Since \ takes principal divisors to 1, A ~ A; 
implies A(A) = X(A;). Secondly, if k > 2g — 2, the number of effective 
divisors linearly equivalent to A; is (¢*~9t! — 1)/(q—1). This follows from 
Lemma 5.7 and the fact that 1(A;) = k—g+1 since degy Aj = k > 2g —2 
(see Theorem 5.4, Corollary 4). 

Let D be a divisor of K of degree 1. Such a divisor exists by the theorem 
of F.K. Schmidt. Write A;-kD = B; for each i with 1 <7 < h. The divisors 
{B,, Bo,--- , By} have degree zero and, in fact, are a set of representatives 
for the divisor classes of degree zero. Substituting A; = kD + B; in the 
above sum, we see that 


k-g+l1 _ h 

\(A) = 1" py S7 AB) | 
deg, A=k q i=1 
The latter sum is zero, since it is the sum of the character evaluated on 
all the elements of the group Cl?%, and by hypothesis, A is not trivial on 
that group. This completes the first part of the proof. 

To prove the functional equation for L(s, A) we first ease the notation by 
setting u = q~* and writing D(s,) = f(u, A) . We have shown that f(u, 4) 
is a polynomial in u of degree at most 2g — 2. Now, 


L(s,A)=f(urAy= Yo AA)usee« 4 


deg yp A<2g—2 


WA) _y 
= 3 A(A) q 1 y tee A . 
deg x A<2g-2 q~ 
The first sum is over effective divisors A and the second sum is over divisor 


classes A. The passage from the first sum to the second uses Lemma 5.7 
once again. 
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A simplification occurs because 


\MA)uteex A= 0. 
deg, A<2g—2 


To see this, simply sum by degrees and check that each coefficient of the 
resulting polynomial is 0 because the sum of A evaluated on all divisor 
classes of a fixed degree is 0 (one reduces to the case of divisor classes of 
degree zero, as above). We are thus led to the following simple expression 


(q—1)f(u,A) = S- NA)g Aydeex 4 . 


deg, A<2g—-2 


Multiply both sides by u~9 = q'9~1)* and we get 


(q —1)u!~9f(u, A) = S- MA)gAydeea A-9+1 


deg xp A<2g—2 


The key observation is that if C denotes the canonical class, the map 
A —- C— <A is a permutation of the divisor classes of degree less than or 
equal to 2g — 2. Thus, in the last summation we can substitute C — A for A 
and the sum remains the same. Let’s investigate how the individual terms 
change. ee 

The expression \(A) becomes A(C — A) = X(C)A(A)7+ = XA(C)A(A). 

The expression q'‘4) becomes q!(°-4) = g9—!-de8x Ag'(A) | since, by the 
Riemann-Roch theorem, [(A) = degry A-~—g+1+l1(C — A). 

Finally, u?eéx 4-9+! becomes u9~1~4¢8x 4 since deg C = 2g — 2. 

Making all these substitutions in the above equation yields 


(q- L)u' 9 f(u, A) = A(C) S_ NA) gq!) (q7tu71)se8« A-g+l _ 
deg pp A<2g—2 


(Q— 1)AC)qriu*) 9 Ff (qr tu", A) . 

If we let F(u,A) = u’9f(u,r), we have shown that F(u,A) = 
MC) F(q7tu71, A). Since F(u, A) = q9- YS L(s, A) = A(s, A), the functional 
equation we have proven for F’(u, A) translates into the functional equation 
for A(s, A) given in the statement of the Proposition. 

It remains to prove that L(s, A) is a polynomial in q~* of degree 2g — 2. 

Rewriting the functional equation for f(u, A) we derive 


u~@9~2) Fu, d) = A(C)q* f(q7 tue, A) . 


The constant term of f(u, A) is 1 (this is immediate from the definition), 
Thus, the right-hand side tends to A(C)q9~' as u — oo. It follows that 
f(u,A) has degree 2g — 2 and also that the coefficient of the leading term 
is \(C)q9—'. Translating back to “s” language shows that L(s, ) is a poly- 
nomial in q~* of degree 2g — 2, as asserted. 
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Exercises 


1. 


Let L/K be a Galois extension of function fields with Galois group 
G. Let Dr be the different divisor. Show that oDz,/~% = Dz/x for 
alla EG. 


. Let L/K be a Galois extension of function fields with Galois group 


G. Suppose there is a prime P of K which is inert in L; i.e., there is 
a prime $8 in L, lying above P, such that f(%/P) = [L: K]. Show 
that G is cyclic. 


. (Continuation). Conversely, if G is cyclic, show that there exist in- 


finitely many primes P in K which are inert in L. What is the density 
of this set of primes? 


. Suppose that E/K is a geometric and separable extension of function 


fields. Let L be the smallest Galois extension of K containing L. 
Show by example that the constant field of L may be larger than the 
constant field of K. 


. Let Ey and FE» be two finite Galois extensions of a function field K. 


Suppose that there is a prime P of K which is totally ramified in 
&, and unramified in HE. Show that Fy E2/E2 is Galois with group 
isomorphic to Gal(F,/K) and that every prime in £2 lying above P 
is totally ramified in Fy Eo. 


. Let L/K be a Galois extension of function fields with Galois group G. 


Let N be a normal subgroup of G and L’ the fixed field of N. Let % 
be a prime of L and P the prime of K lying below 9%. If J(8/P) CN, 
show that P is unramified in L’. If 7(8/P) C N, show that P splits 
completely in L’. 


. Suppose L/K is a Galois extension of function fields and that 2 


is a prime of L. If a € L*, and o € Gal(L/K), show ord, (a) = 
ordy(o~*a). In particular, if % is fixed by Gal(L/K), then for any 
a € L*, all the conjugates of a have the same order at $B. 


. Let L/K be a Galois extension of function fields with Galois group 


G. Let $8 be a prime of L and P the prime of K lying below it. We 
assume that G = Z(B/P) (if this isn’t true, simply replace K by the 
fixed field of 7(8/P)). Define subsets of G as follows: Gm = {o € 
G | ordy(ca—a) > m+1,Va € Og}. Show that these sets are normal 
subgroups of G. Note that G_; = G = Z($8/P) and Go = I(8/P). 


. (Continuation) We wish to study the structure of Go = I(8/P). 


We can replace K with the fixed field of [(9$8/P). Once this is done, 
we can assume ‘$8 is totally ramified over P. If II is a uniformizing 
parameter at 5B, it can be shown that Og is free as a module over Op 
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12. 


13. 


14. 


15. 


16. 


17. 
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with a basis {1, II, II?,...,I[®°~'}. Using this, show for each m > 0 
that Gn = {o € Go | ordg(oII — II) > m+ 1}. Also show that 
Gm = (e) for all sufficiently large integers m. 


(Continuation) Define U(™ = {u € Ox | ordp(u—1) > m}. For 
m > 0 we define maps pm : Gm/Gm+1 Ulm) m+) by sending 
o € Gm to the residue class of oII/TH in UO™/U'"+), Show that pm 
is independent of the choice of uniformizing parameter. Show that 
Pm is a homomorphism and that it is one to one. 


(Continuation) Let Fay = Og /. Show that there is a monomorphism 
from U/U“) to Ff and monomorphisms from U (m) /U(™+)) to Fry 
for all m > 1. Deduce that Go/G, is cyclic of order prime to p=char 
F, and that G, is a p-group, in fact, the unique p-Sylow subgroup of 


I(B/P). 


Let L be a function field over a constant field F’ and suppose that 
go is an automorphism of L which is the identity on F’. Let 8 be a 
prime of L. Then, o induces a continuous map from O,-1y — Og 
which extends to an isomorphism (which we continue to call o) from 
Lo-19 > Dg. Define ¢ : Ay > Az by G(aq) = (by) where by = 
Oa,-i9 for all primes $ of L. Show that @ is a ring automorphism 
of A; and that its restriction to L is a. 


Let L/K be a finite separable extension of function fields. In Chapter 
7, we defined a trace map trz;~% : A, — Ax. We now define a map 
in the other direction, i,;~% : Ax — Az sending the adele (ap) to the 
adele (bs) whose §B-th coordinate is ap for every prime $B lying over 
P. Show that i,/% is a one-to-one ring homomorphism which sends 


K to L. 


(Continuation) Show that trz;% °1,/K is multiplication by n = [L: 
K]. If n is not divisible by the characteristic of L, conclude that tr; 
is onto. 


(Continuation) Suppose L/K is a Galois extension of function fields 
with Galois group G. Show that the adeles of L which are fixed by 
G, AG, are equal to in/KAK. 


Let L/K be a Galois extension of function fields and o an auto- 
morphism of L which leaves the constant field fixed. Ifw € Dy, isa 
differential, define ow by ow(a) = w(o~1a) for all a € Ay. If w van- 
ishes on Az(D) for a divisor D, show that ow vanishes on Az(oD). 
Use this to prove that ow is a differential. If (w) is the divisor of w 
show that the divisor of gw is a(w). 


(Continuation) Let O denote the zero divisor and 2x(O) the space 
of holomorphic differentials. Show o maps 2.,(O) into itself. Assume 


18. 
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that g > 2, where g is the genus of L. Also assume that o is non- 
trivial and of finite order prime to the characteristic of L. Show that 
the action of o on 2;,(O) is non-trivial. (Hint: Let K be the fixed field 
of o. Ifo acts trivially on 2;(O), show the map w > w* = wotry/x is 
an isomorphism between 0% (O) and 2;(O) . Conclude that gr, = gx. 
This contradicts the Riemann-Hurwitz formula). The proof outlined 
here is due to R. Accola. 


Let k = F(T) be the rational function field and A = F[T] be the ring of 
polynomials. Let m € A, m ¢ F*, and suppose m = aP;'! P;?... Pi" 
is its prime decomposition. Each P; corresponds to a prime 9B; of k. 
Let M = 5° a,9; be the effective divisor of k corresponding to m. We 
set M.. = M + oo. Show that Cl, & (A/mA)* and deduce that a 
ray class character modulo M,., restricted to the divisors of degree 
zero, is the same as a Dirichlet character modulo m. 


10 


Artin’s Primitive Root Conjecture 


By now we have developed a lot of foundational material about the arith- 
metic of function fields. In this chapter we will put this material to work 
and give the beautiful proof, due to H. Bilharz, of E. Artin’s conjecture 
about primitive roots in function fields. 

The work we will describe is the PhD thesis of Bilharz, who wrote the 
thesis under the direction of H. Hasse. His paper appeared in 1937 (see 
Bilharz [1]). 

Bilharz dates the origin of the conjecture very precisely. He claims Artin 
made his conjecture in a private conversation with Hasse which took place 
on September 12, 1927. Artin considered an integer a € Z which is not in 
the set {0,1,—1} . Let M, be the set of primes, not dividing a, for which 
a is a primitive root. Does this set have a Dirichlet density and if so can a 
formula be found for it? On heuristic grounds, Artin conjectured that the 
density was 


(Ma) = T]0- @—p) TG - 7 ): 
1¢Sa 1ESa 


where the first product is over all primes for which a is not an /-th power in 
Q and the second over the finitely many primes (maybe the empty set) for 
which a is an /-th power. The first product is convergent and, since all the 
terms are non-zero, so is the product. The second term is zero if and only 
if 2 € Sy, i.e. a is a square in Q. Thus, assuming this formula is correct, 
it follows that if a is not 0,41, and not a square, then there are infinitely 
many primes for which a is a primitive root. 
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Artin’s formula is not correct as it stands, but it may be modified slightly 
to give what is believed to be the correct result . The qualitative conse- 
quence described above remains unaffected by this. It is this latter state- 
ment that is known as Artin’s conjecture on primitive roots. It remains 
open to this day. In 1967, C. Hooley gave a conditional proof of the conjec- 
ture, with the correct formula for the Dirichtlet density, by assuming the 
truth of the generalized Riemann hypothesis for a certain set of algebraic 
number fields (Hooley [1]). Even without the Riemann hypothesis, great 
progress has been made in recent years by R. Gupta, M. Ram Murty, and 
D.R. Heath-Brown (see the survey article of M. Ram Murty [1)). 

The thesis problem of Bilharz was to formulate the primitive root con- 
jecture in global function fields and give a proof of it in this context. He 
did this brilliantly except that his proof was conditional on the truth of the 
Riemann hypothesis for global function fields. In 1948, Weil published his 
proof of this result and one consequence was that the Artin conjecture on 
primitive roots was no longer a conjecture, but a theorem, in the function 
field context. 

Let us fix a global function field K with constant field F having q ele- 
ments. Let a € K* and P € Sx, a prime of K which is prime to a. We 
say that a is a primitive root modulo P if its residue class in (Op/P)* has 
order NP — 1, i.e. it is a cyclic generator of (Op/P)*. If a € F* its order 
divides g — 1 and thus a can be a primitive root only for the finitely many 
primes of degree 1. We assume from now on that a € K*, but not in F*. 
The following simple lemma is crucial to what follows. 


Lemma 10.1. Let P be a prime of K not containing a € K*. Then, a is 
a primitive root modulo P if and only if there is no prime 1 € Z satisfying 
both of the following conditions: 


1) NP=1 (mod 1) and 14) aT =1 (mod P) . 


Proof. If there is a prime / satisfying both conditions, then the order of a 
modulo P divides (NP — 1)/l, so that a cannot be a primitive root. So, if 
a is a primitive root, there is no prime / for which both conditions hold. 

Now, suppose there is no prime | for which both conditions hold and let 
h be the order of a modulo P. We claim that h = NP — 1. If not, there is 
a prime ! dividing (NP — 1)/h. In this case, h divides (NP — 1)/l and so 
both conditions of the lemma are satisfied, which is a contradiction. Thus, 
h = NP —1 and a isa primitive root modulo P. 


We can assume from now on that | 4 p since condition 7) of the lemma 
never holds for the characteristic p of F. For each prime ! 4 p in Z let ¢; be 
a primitive l-th root of unity in a fixed algebraic closure of K. We define 
an extension K, of K to be the field obtained by adjoining ¢ and any I-th 
root of a, \/a, to K, ie, K; = K(G, Wa). Since K, is the splitting field of 
the separable polynomial X! — a over K, K, is a Galois extension of K. 
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We will show in a while that both conditions of Lemma 9.1 are satisfied if 
and only if P splits completely in K,. This will tie our discussion in with 
some of the material developed in the last chapter. Before doing so we have 
to take a short detour to discuss cyclotomic and Kummer extensions in 
function fields. 


Proposition 10.2. Let L = K(¢)). Then (L: K| = f(l), the smallest post- 
tive integer f such that qi =1 (mod 1). A prime P € Sx splits completely 
in L if and only if NP =1 (mod 1). 


Proof. Since L = KF(¢,) it is a constant field extension and [LD : K] = 
[F(¢,) : F] by Proposition 8.1. Now, Gal(F(¢;) : F) is generated by ¢, the 


f 
automorphism that takes an element to its q—th power. Thus, 6! (¢;) = Gi. 


It follows that gf fixes ¢; if and only if gf =1 (mod 1). It follows that the 
order of ¢; is f(l) since ¢; generates F(¢;) over F. This proves [L : K] = f (I). 

Since L/K is a constant field extension, every prime of K is unramified 
in Z by Proposition 8.5. If $8 is a prime of L above P, then by definition 
(P,L/K)q = GNP (mod $8). Both sides of this congruence are constants, 
so we must have equality. Thus, 


(P,L/K)(G) = Gh” , 


and it follows that (P, L/K) is the identity on F(¢;) if and only if NP =1 
(mod J). This proves the second assertion of the proposition. 


A more elementary proof of the second part of the proposition can be 
obtained by using the corollary to Proposition 8.13. 

Our next task is to investigate extensions of the type K(./a)/K. These 
are called Kummer extensions. We want to know which primes split com- 
pletely, which primes ramify, and also a formula for the genus of K(,/a) if 
we know the genus of K. The answers to these questions are given in the 
next three propositions. 


Proposition 10.3. Let K be a function field over a constant field F of 
characteristic p. Let | be a prime number not equal to p anda € K*, not 
an l-th power in K*. Let a be a root of X'-a=0 and L = K(a). A prime 
P of K is ramified in L if and only if | does not divide ordp(a). If P is 
ramified, it is totally ramified, i.e., there is only one prime B above it in L 


and e(B/P) = 1. 


Proof. Note to begin with that since 1 4 p, X'—a is a separable polynomial 
and so L is a separable extension of K. Also, since a is not an l-th power, 
z' — a is irreducible (see Lang [4]) and so [L: K] =. 

Suppose first that llordp(a). Let 7 be a uniformizing parameter in the 
valuation ring of P, Op. Then, a = 7'"u, where h € Z and u is a unit in 
Op. Thus, a/7" = yw is an 1 —th root of u and L = K(a) = K(). Let Rp 
be the integral closure of Op in L. We claim {1, p, ?,--+ ,u'~'} is a basis 
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of Rp over Op. It is certainly a basis of L/K since X' — u is irreducible. 
Suppose 2 € Rp. Then 


1-1 
B= So cu with GE K. 
i=0 
By the usual argument, the coefficients c; will be in Op if we can show 
that the determinant of the matrix (trz/«('y’)) is a unit in Op. We claim 
tr(w) = 0 unless I|h, in which case the answer is lu’*/!. This is because pu” 
is a root of X! — wu", which is irreducible when wu” is not an J-th power, i.e., 
when h is not divisible by I. If l|h, then »” = u"/" and the result is clear. 
We now have enough information to show easily 


det(trz/x(u'p?)) = 4U'ul? ” 


This is a unit in Op since | # p and wu is a unit in Op. This shows, 
simultaneously, that {1,,--- ,u'~'} is an integral basis for Rp/Op and 
that the discriminant 0p,/0, = Op. From Proposition 7.9, we conclude 
that P is unramified in L. 

Now suppose that | does not divide ordp(a). Let $8 be a prime above P 
in L. Since a! = a, we have 


l ordgy(a) = ordy(a) = e(B/P)ordp(a) . 


This implies that l|e($8/P). By Proposition 7.1, e(8/P) < l. This shows 
that e(8/P) =I and so P is totally ramified as claimed. 


Before stating the next proposition we pause to give a somewhat technical 
definition which will be useful here and later. Let K/F be a function field 
with the constant field of characteristic p, possibly zero. An element a € F* 
is said to be geometric at a prime | # p if K(./a) is a geometric field 
extension of K; i.e., the constant field of K(./a) is F. Here Wa is some root 
of X' —a = 0 in an extension of K. The definition does not depend on 
which root is chosen (in a given algebraic closure of K). It is an exercise to 
show that a is geometric at | unless it has the form pb! where p € F* — F*! 
and 6 € K*. One way is clear. If a has this form, then K(\/a) = K(./) 


which is certainly a constant field extension. 


Proposition 10.4. Let K/F be a function field, with constant field F of 
characteristic p (possibly, p = 0). Let | 4 p be a prime and L = K(a) 
where a! =a € K*. Assume that a is geometric at 1 and that a is not an 
l-th power in K*. Then, 29, —2 =I(2gx —2)+ Ra(l—1) where R, is the 
sum of the degrees of the finitely many primes P of K where ordp(a) is 
not divisible by l. 


Proof. This is an application of the Riemann-Hurwitz Theorem, Theorem 
7.16, which asserts that 


29, —2= |L: K](2Qgn — 2)+deg, Dijk . 
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Here, L/K is presumed to be a finite, separable, geometric extension of 
function fields and D;/x% is the different divisor of the extension. ‘These 
hypotheses apply to L = K(a), given our assumptions. If $B is a ramified 
prime of L lying above P in K, by Proposition 10.3 we must have e(§8/P) = 
1, f(B/P) =1, and g(P/P) = 1. 

We have to figure out the quantities on the right-hand side of the equa- 
tion. We already know [LJ : K] = 1 since a is not an /-th power in K. 
Since p 4 1, by Corollary 2 to Lemma 7.10, the coefficient of a ramified 
prime $8 in the different is e(B/P) —1=1—-—1. Since § is totally ramified, 
f(B/P) = 1 and so deg, B = deg, P. Thus, the degree of the different 
is just (J — 1) times the sum of the K-degrees of the primes P of K with 
ordp(a) not divisible by / (again using Proposition 10.3). This sum is Ra, 
by definition, so the proof is complete. 


It is worthwhile noticing that if one fixes the base field K and an element 
a € K* satisfying the hypotheses of Proposition 10.4, then, as / varies over 
the prime numbers, the genus of K(,/a) is a linear function of l. This 
observation will be of use later. 


An interesting special case is to take K = F(T), the rational func- 
tional field, f(T) € FT] a square-free polynomial of degree N, and L = 
K(4/f(T)) (assuming that char (F’) 4 2). A calculation, using the proposi- 
tion (and not forgetting the prime at infinity) yields the fact that the genus 
of L is (N —1)/2 if N is odd and (N/2) —1 if N is even. 


Proposition 10.5. Let K be a global function field over a constant field F 
with q elements. Let 1 be a prime different from the characteristic of F. Let 
a € K*. Assume that K contains a primitive l-th root of unity, ¢,. Suppose 
that P is a prime of K and that ordp(a) = 0. Then, P splits completely in 
L= K(/a) if and only if 


NP-1 


a '™ =1 (mod P). 


Proof. Since ¢; € K, the extension L/K is a cyclic Galois extension. Also, 
we must have ¢, € F*, which implies g = 1 (mod /) andso NP = 1 (mod 1) 
for all primes P of K. 

P is unramified in L if ordp(a) = 0 by Proposition 10.3. Thus, the 
Artin automorphism (P, L/K) is defined. The order of this automorphism 
is f(8/P) where $ is any prime of L lying over P. Thus, P splits completely 
if and only if (P, L/K) is the identity automorphism. 

Any two roots of X! — a =0 differ by an J-th root of unity. Let a be any 
root of this equation. Then (P, L/K)a is another root. Thus (P, L/K)a/a 
is an /-th root of unity which is easily seen to depend only on a and not 
on a, The usual notation for this l-root of unity is (a/P),, the [-th power 
residue symbol. We have 


(a/P)ja =(P,L/K)a=aN? (mod §) , 
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where B is any prime of L lying above P. By hypothesis a ¢ P, so we can 
divide this congruence by a to obtain 


(a/P))=aNP l= qT (mod 3B) . 


Since ¢; € K we can conclude that 
(a/P), = a (mod P) . 


Thus, if P splits completely, (P,L/K) is the identity, which implies 
(a/P), = 1 and the above congruence shows aNP-1/! = 1 (mod P). Con- 
versely, if aN?-1!/' = 1 (mod P), then (a/P); = 1 (mod P). Since both 
sides are constants, they must be equal; i.e., (a/P),; = 1. This implies 
(P, L/K) is the identity ( since a generates L) and so, P splits completely. 


Notice that the conclusion is true even if a is an l-th power in K*. Of 
course, everything is trivial in this case. 


Proposition 10.6. Let K be a global function field with constant field F. 
Let | be a prime different from the characteristic of F. Leta € K*. Let 
Ek, = K(q, Wa). Let P be a prime of K such that ordp(a) = 0. Then, P 
splits completely in L if and only if 


NP=1 (modl) and a tT =1 (mod P). 


Proof. Consider the tower of fields K C K(¢;) C E;. A prime of K splits 
completely in L if and only if it splits completely in K(¢,) and every prime 
above it in K(¢;) splits completely in £). 

By Proposition 10.2, P splits in K(¢,) iff NP =1 (mod l). Let ® bea 
prime of K(¢) lying above P. We apply Proposition 10.5 to EB) /K(G); i-e., 
in that proposition we replace K by K(¢,) and L by £). It follows that 
splits completely in &; if and only if 

qt =1 (mod $8) . 
Since both sides of this congruence are in Op, we may replace the modulus 
with P. Also, if P splits in K(¢;), then N58 = NP; so the condition is 


qt =1 (mod P) . 


This completes the proof. 


We now return to the original problem about Artin’s primitive root con- 
jecture in the function field case. By combining Propositions 10.1 and 10.6 
we see that a is a primitive roots modulo a prime P if and only if ordp(a) = 
0 and P does not split completely in any of the fields E; = K(G, \/a), where 
/ runs through all the primes different from the characteristic of F. Let M, 
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denote the set of primes in K which do not split completely in any of the 
fields E,. M, differs from the set of primes for which a is a primitive root 
by at most the finitely many primes P with ordp(a) # 0. In particular, if 
one of the sets has Dirichlet density, both do, and the Dirichlet densities 
are equal. Bilharz’s proof proceeds by showing that the Dirichlet density of 
Ma, 6(Mq), exists and is non-zero with the exception of some very special 
circumstances for which a cannot be a primitive root for infinitely many 
primes. We will make the simplifying assumption that a is geometric. We 
remind the reader that this means that for all primes | # p, the field exten- 
sion K(./a)/K is geometric. If this assumption is not made, there is a gap 
in Bilharz’s proof of his theorem. This observation was also made by J. Yu, 
who has filled in the gap in a paper which is to appear, Yu [2]. By assuming 
a is geometric we avoid these difficulties and lose nothing essential about 
the original proof. 

The outline of the proof is clear and elegant, but the details are some- 
what complicated. We will begin by sketching the outline of the proof and, 
afterwards, go back and fill in the details. 

For any field extension L/K, recall that {LZ} denotes the set of primes 
of K which split completely in L. Note that {kK} = Sx in our previous 
notation. 

We will need the following key result. 


Proposition 10.7. Assume a is geometric and not an l-th power in K. 
Let m be a square-free integer prime to q and E,, the compositum of the 
fields E, for all l|m. Let f(m) denote the order of q modulo m. Then, 
[Em : K] = maf(m), where m, is the product of the primes | dividing 
m for which a is not an l-th power. The Dirichlet density of the set of 
primes which do not split completely in any of the fields Ey, with I|m is 


given by 
p(d) 
Do afd) 


Proof. The field £,, contains ¢,, a primitive m-th root of unity. 

Let [|m. We claim a is an I[-th power in K if and only if it is an /-th power 
in K(¢m). One way is obvious, so suppose a is an I-th power in K(¢,,). This 
implies that K(./a) C K(¢,). A subfield of a constant field extension is 
a constant field extension. Since, by assumption, a is geometric, this can 
only happen if K(./a) C K, i.e., when a is an /-th power in K. 

Clearly, E,, is the compositum of the field extensions K (Gm, W/a)/K (Gm) 
as | runs through those primes dividing m for which a is not an /-th power 
in K(¢n). By the last paragraph, these primes are the same as those for 
which a is not an /-th power in K. Since all these field extensions are cyclic 
of prime order, for distinct primes, we conclude that [En : K(¢m)] = Ma. 

To finish the proof that [E,, : K] = m,f(m) it remains to show that 
[K (Cm) : K] = f(m). Since [K(¢,,) : K] = [F(Gm) : F], this follows in the 
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usual way by computing the order of the finite field Frobenius automor- 
phism dg. 

To compute the density of the primes M(,) which do not split com- 
pletely in E,,, we resort to the inclusion-exclusion principle of set theory. 
Let {l,,l2,--- ,/:} be the primes dividing m. Applied to our situation, the 
inclusion-exclusion principle yields the following expression for the set in 
question: 


Mm) = {K} - Ute} + Utz.) N{E,,}—-++ ete 


For any subset T c {Kk} define 


— Lper NP 
(2s) = log Cx (s) 


It is clear that 6(T’,s) is well defined for s > 1. From our discussion of 
Dirichlet density in Chapter 9 (immediately after Proposition 9.11) we see 
that lim, _,;+ 6(T7,s) = 6(T). 

The above set theoretic-expression for M,,,) yields the following identity 
on the level of functions: 


b(Mim), 8) = 6({K}, 8) ~ 2, {Ei} 8) +) d({ Bn} {Bi, }, 8) - 


0,9 


Taking the limit as s + 1+, we see 


5(Mim)) =1— > _ 6({E1,}) + 5 6({Ei,} 9 {Ei }) — +++ ete. 
4 tJ 
In a finite set of Galois extensions the set of primes which split completely 
in the compositum is the intersection of the sets of primes which split 
completely in the individual extensions. Using this, the above expression, 
Theorem 9.13, and the computation of [E,, : K] in the first part of the 
proof yields 


we) 
Mn) = 2. daf(d) ’ 


as asserted. 


Let 1,,/2,13,--- be an enumeration of the primes different from p, the 
characteristic of F. Let mp, = Il2---+l, and let Em, be the compositum of 
the fields EB; with l|m,, ie. | € {l1,lo,+-+ ,ln}. Let M,, be the set of primes 
which do not split completely in any of the field E,, with 1 <i <n (in the 
notation of the above proof, Mn = Mim,)). Note that Mn D Mnii1 D Ma 
and that (),, M, = M,. We have just shown that 


_ p(d) 
M)= 2 EHO 


10. Artin’s Primitive Root Conjecture 157 


It is tempting to just pass to the limit as n — oo to get Bilharz’s Theorem 
(Theorem 10.19 below). In fact, that is how the theorem is proved, but the 
passage to the limit requires a rather elaborate justification. An important 
part of the justification is played by the following theorem, whose proof we 
postpone. 


Theorem 10.8. (Romanoff) Let q > 1 be an integer and for any integer 
m relatively prime to q, let f(m) be the order of q modula m. Then, the 


series 
Dar 
— mf (m) 
converges. The sum is over all square-free integers m relatively prime to q. 


We can now state the principal result of this chapter. 


Theorem 10.9. (Bilharz) Assume a is geometric element of K*. Then, 
with the above notations, the Dirichlet density of Mq exists and is given 


by 


(Ma)= DU om 
(mp)=1 


The sum is easily seen to be absolutely convergent using Romanoftf’s 
theorem. 

As we have already pointed out, Bilharz does not make the restriction 
that a be geometric. His proof seems to contain a small error involving 
the computation of the degree [E,, : K]|. This problem has recently been 
corrected by J. Yu, but we will be content with proving the theorem as 
stated. 

Before going on to the proof, we discuss the consequences of Bilharz’s 
theorem. The main difficulty is to determine when 6(M,) is zero and when 
it is not. The above expression for 6(M,) as sum does not immediately 
resolve this problem. One needs the following special case of a result of 
H. Heilbronn, whose proof we also postpone. 


Proposition 10.10. With the same notations as Theorem 10.9, 


» AT) a O-Gm) 


Let S denote the set of primes | 4 p for which a is an /-th power. S 
is a finite set, possibly empty. We can rewrite the right-hand side of the 
equation in Proposition 10.10 as follows: 


H (75) H0-3). 
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The first (infinite) product converges and is not zero since the individual 
factors are non-zero and the product converges, since dig g 1/lf(l) con- 
verges by Romanoff’s theorem. The second (finite) product is zero if and 
only if f(l1) = 1 for some prime / € S. In other words, the whole product 
is non-zero unless there is a prime / dividing gq — 1 for which a is an /-th 
power. This leads to the following theorem. 


Theorem 10.11. (Bilharz) Let a be a geometric element of K*. Then 
there are infinitely many primes P € Sx for which a is a primitive root 
provided that there is no prime divisor | of q—1 for which a is an I-th 
power. If there is such a prime divisor, then a is not a primitive root of 
any prime P € Sx. 


Proof. We have just shown on the basis of Theorem 10.9 and Proposition 
10.10 that if a is not an l-th power for some prime || q—1, then 6(M,) 4 0. 
Since we pointed out earlier that M, differs from the set of primes for which 
a is a primitive root by a finite set, it follows that the latter set has non-zero 
Dirichlet density and so must be infinite. 

If 1] g—1 and a = b! for some b € K*, then for any prime P of K not 
dividing a we have (using | g—1| NP —1) 


at =pNP-1=] (mod P) . 
Thus, a is not a primitive root for any prime P of K. 


We now turn to the proof of Theorem 10.9. We first show how to reduce 
the proof of the theorem to the proof that a certain infinite sum of functions 
converges uniformly. We then prove that assertion. This is the hardest part 
of the proof. After that we give a proof of Romanoff’s Theorem. Finally, 
we prove Heilbronn’s result, Proposition 10.10. 

Suppose a € K™* is a fixed geometric element. Recall that M,, is the set 
of primes in K which do not split in any field FE), for 1 <i <n. We note 
again that 


1) Mn DMnrii D Ma a) () Mn = Ma . 


n=1 
To these two properties we add a third: 
OO 
iti) Mn -Mac |) {Ei}. 
1=n+1 


This follows since any prime in M, which is not in M, must split com- 
pletely in E,, for some i > n. : 
From property 7) we see that for any s with s > 1 we have 


0(Mn, 8) > (Mri, 8) > 6(Ma,8) . 
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Thus, limn—+oo 6(Mn, 8) exists and is > 6(Mag, 8). 
We want to estimate 6(M,,s) — 6(Maga,s). To do this we first observe 
that by property iii we have 


To go further we need an observation and a key lemma. The observation 
is that for any Galois extension L of K and s which is real and greater 
than 1 we have 

1 log¢z(s) 
d{ Lh s)< wa 
Uh 8) [L : K] log ¢x(s) 


To justify this, one can go back to the proof of Theorem 9.13 and note 
that the proof shows log ¢,(s) = [L: K] >} pez; NP~* + R(s) where R(s) 
is positive when s is real and cer than 1. Dividing both ve of this 
equation by [L : K]|log¢x(s) proves what we want. 

Putting the last two inequalities together yields 


1 log Cx, (s) 
5(Mn,s) — 6( MoS > Ty Ey Bl Tortx(s) (1). 


We can now state the main lemma 


Lemma 10.12. There is a real number s; > 1 such that 


oe log Cx, (8) 
2 BK] 54 log Cic(s) 


converges uniformly on the interval (1, s;). 


Assuming this lemma we will conclude the proof of Theorem 10.9. We 
will then give the proof of the lemma. 

From Lemma 10.12 and Equation 1 we see that limn.o6(Mn,s) = 
6(M,, 5) on (1,s,) and that the convergence is uniform. We can use the 
following standard fact from real analysis. 


Fact. Let {fn(s)} be a sequence of functions on the interval (so, s1) which 
converges uniformly to a function f(s). For each n suppose lims_,, fn(s) = 
A, exists and that lim,» An = A exists. Then, lim,_,,, f(s) = A. In 
other words, 


lim lim fn(s) = lim lim f(s) . 


$—>89 N—-0CO N—-OCO S—>8O 


We now apply this fact to the sequence of functions 6(M,n, s). 


160 Michael Rosen 


lim 6(Ma,s) = lim lim 6(My,s) = lim lim 6(M,y,s) = 


s—y1+ s—y1+ n—- 00 00 s>1t 
(d) = p(m) 
FOO ys af (a) a Maf(m) 


In the next to the last equality we have used Proposition 10.7. The last 
equality uses Theorem 10.8, Romanoff’s Theorem. 

This sequence of equalities gives the proof of Bilharz’s Theorem, Theorem 
10.9, once we have proved Theorem 10.8, Proposition 10.10, and Lemma 
10.12. 

Still assuming the truth of Romanoff’s Theorem, we next tackle the proof 
of Lemma 10.12. 

Since a is an /-th power for only finitely many primes, to prove Lemma 
10.12 it suffices to prove that the sum 


] log Cz, (s) (s) 
> FO If(L) log ¢x(s) 2) 


is uniformly convergent on some interval (1,5), where the sum is over all 
primes / ¥ p for which a is not an /-th power. 

Let R = F(T) denote the rational function field over F and R; = F(¢;)(T) 
denote the rational function field with F(¢,) as constant field. It follows from 
Theorem 5.9 that 


CE, (3) — Ln, (s)Cr,(s), 


where Lg, (s) is a polynomial in q~/“* of degree 2g, where g; is the genus 
of E,. (Remember that f(l) = [F(¢,) : F].) Taking the logarithm of both 
sides of this relation and substituting into Equation 2 gives 


' 1 log¢z,(s) 1 aon 1 log¢r,(s) 
dF If (1) log Cx (s) LD Lf (2) log Ce(s) aor If(l) log CK (s) 8) 


It thus suffices to prove that these two sums are uniformly convergent on 
some interval (1, s,). We shall first prove this for the second sum. 
For s > 1 we note that 


1 1 


BS ETOH T= FOV § Tae SM) 
Thus, 
r log Cr, (s) 1 log Cr(s) 
> TF low cies) < De TAD fo cx) ) 


Since both Cr(s) and ¢x(s) have a simple pole at s = 1, it follows easily 
that the ratio log ¢r(s)/log ¢x(s) + 1 as s + 1+. Thus, there is an interval 
(1,81) such that this ratio is less than 2 for s € (1,81). It follows that 
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the right-hand sum in Equation 4 is dominated by the convergent sum 
kp OE This establishes the uniform convergence of the second sum in 


Equation 3 on the interval (1, s,). We now turn our attention to the first 
sum. 
From Theorem 5.10, we deduce the following: 


291 


Le,(s) = |] (1 — mq f*) ; 


j=l 


where each 7; has absolute value q/‘/? (it is here that the Riemann hy- 
pothesis for function fields is used). Assuming that s is real and bigger than 
1, we obtain the following inequalities: 


29 29 
(1-q°*") ' < Lp,(s)< < (1 +4 “7 ) . 


By Proposition 10.4 and the remark following it, we see that there is a 
constant r, independent of J, such that 2g, < rl for all /. Substituting this 
into the last equation and taking logarithms of the terms of the resulting 
inequalities yields 


rllog (1 -q?) < log Lz, (s) < rllog (A+q°* ) ; (5) 


If 0 < a, then log(1 +2) < xz, so 


log Lz, (s) < rlq7 i 


To deal with the left-hand side of Equation 5, note that for 0 < 2 <1 


—log(1—a) = Leche 


k=1 


Substitute z = q~f\)/ into this and also use the fact that (1—q7/(9/?)-! 
(1 —q71/?)-1!, We obtain 


q  _f£0 
eat 2 <logLg,(s) . 


Altogether, we have established that 


—rl 


|log Lz, (s)| < rl 


Jq—1 


Since log ¢x(s) + 00 as s > 17, we see that 1/| log ¢x(s)| is bounded by 
some constant, say, C’, on the interval (1,s,). Thus, for s € (1, s,) we have 


1 |logLa(s)) 4 VF yw] 
> 0 IFO Mostk(s)) <"° a1 d Fda 
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We will have established that the first sum on the right-hand side of 
Equation 3 is uniformly convergent on the interval (1,51) once we prove 
the following lemma. This will also finish the proof of Lemma 10.12. 


Lemma 10.13. The sum , 


lAp fq" 
1s convergent. 


Proof. We will break the sum up into two subsums, the first over all primes 
l such that | < qf/2 and the second over all primes such that | > q/ (1)/2 
For the first subsum we have 


1 1 
nn —_— 
» fq — D fy 


lAp 
l<qftt)/2 


The latter sum converges by Romanoff’s Theorem. 

To analyze the second subsum, we first try to figure out how many primes 
! there are such that f(l) takes on a fixed value f. Such primes must divide 
gf —1. Let ly, lo,--- ,14, be the set of such primes. We have 


Iylo---le, | qi - 


We are now considering primes / such that gf/? < 1, so it follows that 


which is a convergent series. In fact, its sum is —log(1 — q72). 
Since both subsums converge, the proof is complete. 


The two remaining things which need proof are the results of Romanoff 
and Heilbronn. Both these proofs belong to elementary number theory 
and not to the arithmetic of function fields. Nevertheless, for the sake of 
completeness we will sketch these proofs. 

The proof of Romanoff’s Theorem uses the following lemma. 


Lemma 10.14. Let y denote Euler’s constant. 


yo he < “567 loglog(n) + O(1) . 
d\n 
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Proof. First of all, note that 


pd)" 1 

dy - = T+ Db) 

d|n p|n 
The product is over all prime divisors p of n. We break this product up 
into two parts - first the product over prime divisors > log(n) and secondly 
over prime divisors < log(n). 

If p1,p2,°** ,Pg(n) are the prime divisors of n which are greater than 
log(n) we find, using [| p; < n, that 
log(n)9™ <n, 


and, consequently, g(n) < log(n)/loglog(n). It follows that 


MH (+2)<(+gis) = 1+ (gpa) 


pin 
log(n) <p 


We now consider the product over prime divisors less than log(n). 


Uo (-s)= a (-3) mw . 


p|n |n pin 
p<log(n) p<log(n) p<log(n) 


The first product on the right-hand side is 1/¢(2) + O(1/log(n)) = 6/m? + 
O(1/log(n)). The second product is < e7loglog(n) + O(1) by Merten’s 
Theorem (see Hardy and Wright [1], Theorem 429). 

Putting all these estimates together gives the result. 


We now have everything we need to give the proof of Theorem 10.8. 
We begin by rewriting the sum in question as follows (all sums are over 
square-free m prime to q): 


where d(n) = Dim, f(m)an L/m. 
Define D(n) = S7y_, d(k). A moment’s reflection shows that 


1 
m,f(m)<n 


Any m entering the definition of D(n) must be a square-free divisor of 
A(n) = (q” — 1)(q"~1 — 1)-++(q—1). Clearly, A(n) < gq”. Thus, using 
Lemma 10.14 we find 


)< S- pm)” <<" log log A(n) < M log(n) , 
m|A(n) 
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for an appropriate positive constant M. 
Let’s define D(0) = 0 and calculate 


‘d(n) < (n —1) D(n) — D(N) 
yo eee lane)? 


n=l n=l 


Since D(n) < M log(n), we see that for large N the right-hand side of 
this equation is less than 


_ _log(n)_ 
1+M ) am+l) me) 


This is finite, and this implies }>~, d(n)/n converges. The proof is com- 
plete. 


This simple proof is due to M. Ram Murty. The same argument can be 
made to give much stronger and more general results of the same nature. 
For these improvements and generalizations see Murty-Rosen-Silverman 
[1]. 

Our final task in his chapter is to prove the inequality due to Heilbronn, 
Proposition 10.10. This will be seen to follow from a more general inequality 
belonging to the elementary theory of numbers (see Heilbronn [1]). 

For any subset S C Z*, the positive integers, we define its natural density 
to be 


d(S) = Jim X~'#{n eS |n< X}, 


provided that the limit exists. 

If S has a natural density, then it is not too hard to show it has a Dirichlet 
density as well, and that the two are equal. On the other hand, there exist 
sets with Dirichlet density, but not natural density. 

Natural density has a number of simple and easily proven properties. 


1. d(S) =Oif S is a finite set. 

2. d(Zt)=1, where Z* is the set of all positive integers. 

3. If S, C So, then d(S;) < d(S2), provided that both densities exist. 

4. If S,;NS2 = ¢, then d($, U Sg) = d(S,) + d(S2), provided that d(S}) 
and d(S2) both exist. 

5. Let h be a positive integer and define hS = {hs | s € S}. Then, 
d(hS) = 7d(S), provided that d(S) exists. 


It follows that the natural density of the set of integers divisible by a 
positive integer a is exactly 1/a and, consequently, the natural density of 
the set of integers not divisible by a is 1 — 1/a. Let {a1,a2,...,an} bea 
finite set of positive integers and T,, the set of positive integers not divisible 
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by any a; with 1 <i <n. It is an exercise to show, using the inclusion- 
exclusion principle, that 7, has a natural density given by 


WB) =1- D+ gage tl meal 


4 l<icji<n aa] a; 


In this equation, the square brackets denote least common multiple. 

Heilbronn gives the following lower bound for this density. It is worth 
pointing out that the inequality becomes an equality if the integers a; are 
pairwise relatively prime. 


Lemma 10.15. 


Proof. The proof is by induction on n. If n = 1 the assertion reads d(T,) = 
1 —1/a; which we have already noted. So, we assume the result is true for 
nm and prove it for n+ 1. 

Note that T,, = T,41US where S is the set of positive integers divisible by 
Qn+1 and by none of the integers a; with 1 <i <n. This is a disjoint union. 
A moments reflection shows that S C ani17T,. Thus, d(S) < —+-d(T,). 


—~ Qn+1 


Consequently, 


1 


Qn+1 


0-5) 2 (0-3) 


The last inequality follows from the induction assumption. 


d(Tn+1) = d(T) _ d(S) = d(T) -_ d(T) 


To go from this elementary lemma to the inequality of Proposition 10.10 
we need to first give a mild generalization which is proven, as we shall see, 
by an amusing geometric argument. 


Lemma 10.16. Let x1, 29,...,2%n, be real numbers with 0 < x; <1 for each 
1. As above, let {a1,a2,...,Qn} be a finite set of positive integers. Then 
nr 
Li LiL; x 
B+ SD sah cores eH (1-2), 
Qj; l<icj<n lax, a5 mann i=l 


Proof. (Sketch) Let F'(z1,22,...,2,) denote the difference between the 
left-hand side and the right-hand side of the inequality given in the state- 
ment of the Lemma. We think of F' as a function on the unit cube and 
prove that it is non-negative on this domain. That will prove the lemma. 
If we fix the values of all the variables except x; the resulting function is 
an inhomogeneous linear function of x,. Consequently, on the interval [0, 1] 
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it takes its minimum value at one of the endpoints. Using this fact, and a 
simple induction on n, we find that on the unit cube, F’ takes its minimum 
at a vertex. A vertex has coordinates (€1, €2,...,€n), Where each e¢; is either 
0 or 1. At such a point, the value of F' is non-negative by Lemma 10.15. 
Thus, F’ is non-negative on the unit cube. This completes the proof. 


We can now prove Proposition 10.10. Let 1) < lg < lj < --+ be an 
enumeration of the positive prime numbers different from p. Recall that 
f(m) is the order of gq modulo m and that, for square-free m, f(m) is the 
least common multiple of {f(l) | ljm} 

Define m,, = l,lo:--l,. Then 


p(d) ~ I ft 
x df(d) ~ 2. lif (li) + 2d lils[F (li), Fs) 


cee lyla+++ln[f (li), f(t), + »f(n)| ) 


By Lemma 10.16, setting a; = f(l;) and x; = 1/l;, we obtain 


© #521 6-rn) =D C-an). 


[mn l~Ap 


Using Romanoff’s theorem one more time, it is easy to see that the left- 
hand side of this inequality tends to }7,, (m,g)=1 H(m)/mf(m) as n tends 
to oo. 

We have now proven Theorem 10.10 in the case where a is not an /-th 
power for any prime | ¥ p. In the general case one proceeds the same way 
as above except that one sets x; = 1/1, if a is not an /,;-th power and x; = 1 
if a is an /;-th power. We leave it to the reader to check that this procedure 
leads to the correct result. 


Exercises 


1. Let K/F be a function field and suppose K contains a primitive /-th 
root of unity, where / is a prime unequal to the characteristic of K. 
If a € K* is not geometric at |, show there is a uw € F* andabe K* 
such that a = pb’. 


2. In the course of the proof of Proposition 10.5 the /-th power residue 
symbol, (a/P);, was defined. Show that it is a good generalization 
of the Legendre symbol of elementary number theory, by proving 
that it has the following three properties: (i) (a/P); = (b/P), if 

= b (mod P), (ii) (ab/P); = (a/P),(b/P), and X' = a (mod P) 
is solvable (for a 4 0) if and only if (a/P); =1. 


10. 


11. 
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. Prove that [F(¢m) : F] = f(m) is the order of gq modulo m, provided 


that (q,m) =1. 


. Prove Property 5 of natural density. Namely, if S C Z* and he Zt, 


then 6(hS) = h~+6(S) provided that either density exists. 


. In Lemma 10.5, show the inequality is an equality if the a; are pairwise 


coprime. What happens if all the a; are all equal? 


. Suppose K’/F is a function field of characteristic p. If a € K* is not 


an l-th power for any prime I, then it follows from the text that the 
Dirichlet density of the set of primes for which a is a primitive root 
exceeds ]]),,(1 —/~*f(1)~*) = ep, which does not depend on a. If 
p > 2 show c, < .5.Is co > .5 ? (Recall that f(/) is the order of q 
modulo !, where g = |F|.) 


. Let K/F be a function field and suppose | |g — 1 where q = |F|. Ifa 


and b are geometric at 1, show the constant field of K (,/a, v/b) is F 
unless ab’ € F*K* for some i with 1 <i <I. 


. Let K/F be a function field and let | be a prime different from 


the characteristic. Two elements of K*, a and b, are said to be I- 
independent if for all integers m and n, ab" € F*K *! if and only if 
llm and lin. Assume a,b € K* are geometric at | and l-independent. 
Define K, = K(¢,,.Wa, Wb). Prove that [K,: K] = f(DI?. 


. With the same notation as the previous problem, suppose a,b € K* 


are l-independent for all primes | different from the characteristic of 
K. Show there is a constant c depending only on a and b such that 
the genus of K; is bounded by cl? for all I. 


(Continuation) Let a,b € K* be geometric and [-independent for all 
primes / different from p, the characteristic of K. Define Ma to be 
the set of primes P of K such that (Op/P)* is generated by the 
residues of a and b modulo P. Calculate 6(M.,,,). Use Heilbronn’s 
Theorem to show that 6(M,.) > 6/7?(1 — p~*)~*. (Hint: Imitate 
the proof of Bilharz’s Theorem. Because many of the sums involved 
converge for trivial reasons, one need not use Romanoff’s Theorem.) 


The reader may wonder if there are elements which satisfy the hy- 
potheses of the previous problem. In fact, they exist in abundance. 
Suppose S is a finite set of primes with more than three elements. Let 
E(S) = {a € K* | ordp(a) = 0, VP ¢ S}, the group of S-units of A™. 
In Chapter 14 we will show that E(S)/F* is a free group on |S|— 1 
generators. Let a,b € E(S) map onto elements of a basis of E(S)/F*. 
Show that a and b are geometric and that they are /-independent for 
all primes J. 


11 


The Behavior of the Class Group in 
Constant Field Extensions 


In Chapter 8, we discussed constant field extensions and, toward the end 
of the chapter, we gave particular attention to the case when the base field 
has a finite field of constants. We begin by recalling some notation. 

Let K be an algebraic function field over a finite field F with q elements. 
Fix an algebraic closure F of F, and let F, be the unique subfield of F 
such that [F, : F] = n. Let K, = KF, be the constant field extension of 
K by F,, and h(K,,) the class number of K,,. By definition, h(K,,) is the 
number of elements in the group of divisor classes of degree zero of the field 
K,. Formerly we denoted this group by Cl% . We simplify the notation 
by writing Cl = J(F,). The choice of the letter “J” is not completely 
arbitrary. If one regards K as the field of rational functions on a smooth 
curve defined over F, then J is closely related to the Jacobian variety of 
that curve. 

To begin with we will discuss the question of how the class numbers 
h(K,,) vary with n. The main tool will be the formula 


29 


h( Ky) = [[¢ — 7) , 


1=1 


given in the corollary to Proposition 8.16. We will also use some elementary 
facts from algebraic number theory and the theory of /-adic numbers. 
Afterwards we will deal with the more difficult question of how the finite 
abelian groups J(F,,) vary with n. Once again we will need to deal with 
some elementary results from the theory of /-adic numbers. We will also 
use some results from the theory of cohomology of groups. Readers who 
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lack this background may want to just skim these proofs and move on. 
Finally, we will need some basic facts which can be stated entirely in the 
language of function fields, but whose proof requires a substantial amount 
of algebraic geometry. We will state these facts as clearly as possible and 
use them to derive a number of interesting results on the behavior of the 
class group under constant field extension. For the proof of the basic facts 
we will have to refer the reader to other sources. 

The chapter will conclude with a brief discussion of how the material 
presented in this chapter was imported into algebraic number theory by K. 
Iwasawa. The resulting theory has been a fundamental tool in many of the 
most important developments in number theory over the past 50 years. 


Let wu = q~*. Then the zeta function of K written as a function of u has 
the form 
Lx(u) 


(1—u)(1— qu) © 

where Lx (u) is a polynomial of degree twice the genus with integer coef- 
ficients and constant term 1. If we factor Dx (u) over the algebraic closure 
of Q, we obtain 


ZK(u) = 


29 
Lx(u) = [ [GQ — 7x), 
i=1 
where the 7; are algebraic integers. It was pointed out after Theorem 5.9 
that the functional equation for Z%(u) is equivalent to 7 — q/7m being a 
permutation of the roots of Lx(u). Write Dx(u) = an a,u*. It is then 
easy to see that another way of stating the functional equation is 


gar =dg-k for O0O<k<qg. 


We will now use these facts to prove several things about how class numbers 
behave in constant field extensions. 


Proposition 11.1. Let 1 be a prime which divides h(K). If 1 | ni}, 
then 1 |h(K,)/h(K). 
Proof. Let & be the number field obtain by adjoining all the elements 7, 


to the rational numbers Q, and let L be a prime ideal of F lying over I. 


Then, J |h( A) implies 


29 


[[q —7;)=0 (mod £). 


i=1 
It follows that there is an i such that 7; = 1 (mod L). Let 7 be such that 
747; =q. Then, 7; =q (mod CL). Now write 


h(K,,)/h(K) = Tha + 1; +1? fo... +777") (mod £) . 


1=1 
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The i-th term is congruent to n and the j-th term is congruent to g”—1/q—1 
modulo £. The Proposition follows from this. 


Corollary 1. [fl |h(K) andl |n, then | |h(Ky)/h(K). 
Corollary 2. [fl |h(K) andn=l', then I’ |h(K,)/h(K). 
Proof. Just use Corollary 1 and induction on ¢. 


Corollary 3. /f the genus of K is 1 andl |h(K), then 1 |h(K,,)/h(K) tf 
and only if l |n(q” —1)/(q—1). 


Proposition 11.1 is about divisibility. The next result will be about non- 
divisibility. First, a definition. If | is a prime different from p, the charac- 
teristic of F, let d(l) be the least common multiple of the numbers 1” — 1 for 
1<k < 2g. Let d(p) be the least common multiple of the numbers p* ~ 1 
forl<k<4q. 


Lemma 11.2. As above, let L be a prime of E lying above |. If 7; € L, 
then ri) = 1 (mod £). 


Proof. Let Li, (u) = u29LK(1/u). Then, L%,(u) is a monic polynomial with 
integer coefficients whose roots are the numbers 7;. Suppose 7; ¢ £ and let 
7m; be the residue of 7; modulo £. Then, 7; is a root of Li-(u) modulo J, ice. 
of L%-(u) € Z/lZ/u]. Thus, it satisfies an irreducible polynomial over Z/IZ 
of degree k where 1 <k < 29, iflAp,and1<k<gifl =p. The last 
restriction holds because if | = p, the first g coefficients of L%,-(u) are zero 
(recall that g9~*ay, = dog_4, for 0 < k < g). Since [Z/IZ[m;] : Z/IZ] = k, 
7; is a non-zero element of a finite field with /* elements. This implies 
qed. 1, which in turn implies that nit = 1 (mod £), and so ri) = 
(mod £), as asserted. 


Since for every 7; there is a 7; with 7:7; = q, it follows that every prime 
dividing 7; in FE must lie above p. Thus, the hypothesis 7; ¢ £ is only 
necessary when | = p. 


Proposition 11.3. Suppose | does not divide h(K) and that n is relatively 
prime to d(l). Then, | does not divide h(K,). 


Proof. Suppose | |h(K,,). Then, for some k, 7? — 1 =0 (mod £). By the 
above lemma, ri) = 1 (mod £). Since (n, d(l)) = 1 it follows that 7, = 1 
(mod £), but this implies / |h(A’) contrary to the hypothesis. Thus, / does 
not divide h(K,,). 


Corollary. [fn = l' andl does not divide h(K), then | does not divide 
h(K,). 


Proof. From the definition of d(l), it is clear that | does not divide d(l) 
and so (I’, d(l)) = 1. Now apply the proposition. 
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Proposition 11.4. Let n be he smallest integer such that | |h(K,,). Then, 
n |d(l). 


Proof. There must be an index k such that m? = 1 (mod CL). From the 
definition of n it follows that n is the order of 7, modulo £L. By Lemma 


11.2, re) = 1 (mod £). Thus, n |d(l), as asserted. 


We are next going to consider the behavior of the class group in the 
l-tower. By this we mean the ascending sequence of fields K C K; C Kj2 C 
+++ C Kin Cc... . To begin with we will be interested in the l-primary 
part of the class groups. We denote these by J(F)~)(l). The orders of these 
groups are given by 


[“- where e, = ord; h(Kin) . 


Of course, the numbers e, depend on the prime |, but we have decided not 
to complicate the notation overmuch by making this dependence explicit. 
The main result is as follows. 


Theorem 11.5. There are constants \,, 4, and a positive integer no such 
that for alln > no, €n = A N+; ue., the numbers en, grow linearly 
with n. 


Proof. We will again make use of the formula h(K,) = Ih 1-77), but 
it will be convenient to reformulate it as an /-adic formula. Consider Q as 
a subset of its /-adic completion Q,;. Let Q and Q; be the algebraic closures 
of Q and Q,, respectively. Finally, let p be an embedding of Q into Q,. 
Applying p to the above formula yields h(K,) = [][-2,(1 — o(m)"). Having 
done this, we now simply rename p(7;) as 7; and assume that our original 
formula takes place inside Q,. 

In the usual way, the additive valuation ord; from Q; to ZU oo extends 
to an additive valuation from Q; to QU oo. Namely, if a € Q,, let a be the 
norm of a from Q;(a) down to Q;. Then, ord;(a) = [Q;(a@) : Q;]~tord;(a). 
If ¢ € Q is any primitive J”-th root of unity, it is a standard fact that 
ord;(¢—1) = 1/¢(/") = 1/1"—! (1-1). Thus, as n increases, these valuations 
tend to zero. 

Let 7 € Q be integral over Z;. If ord)(m—1) = 0, we claim that ord; (1 — 
1) = 0 for all positive integers n. To see this, note 


[™—] i~—1 
ordj(m —1) = S- ord)(m — ¢;) = ord;(m — 1) + S- ord;(m — ¢;) . 
j=0 j=l 


Here, the ¢; run through all the /"-th roots of unity and ¢) = 1. We 
are assuming ord;(7 — 1) = 0. It follows that for 7 > 0, ordj(m —¢;) = 
ord)(7—1+1 -—(,) = ord;(#—1) =0. The next to the last equality follows 
since ord;(¢; — 1) > 0 = ord;(a — 1). Thus, all the terms on the right-hand 
side of the equation displayed above are zero and we have proven our claim. 
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Now assume that ord;(a — 1) > 0. Consider all /”-th roots of unity for 
all n. Among all these we claim ord;(7 — ¢) = ord;(1 —¢) with only finitely 
many exceptions. This is because 


ord;(7 — ¢) = ord;(a —~1+1-—¢) =ord)(1—¢) , 


as soon as ¢ is a primitive /”-th root of unity with n so large that ord;(1 — 
¢) = 1/(I") < ord;(a—1). Let S be the set of J-power roots of unity where 
this fails to happen. Choosing n sufficiently large we find 


[~—1 [™—1 
ord;(7 —1) = S- ordj(m—¢;) = S- ord;(m—¢;) + S- ord)(m—C;) . 
j=0 ¢;ES g=1, 6, €S 


In the last sum all the terms can be replaced by ord;(1 — ¢;). Adding in 
and simultaneously subtracting the remaining terms yields 


[7] 
S (ordi(m — ¢j) — ordy(1 — ¢3)) + S © ordi(1 = G5) . 


Call the first sum c. The second sum is actually equal to n as we see by 
differentiating the equation 2 — 1 = [jo (2 —¢;), setting x = 1 in the 
result, and then taking ord; of both sides. 

Summarizing, we have shown that if ord;(7 — 1) > 0, then for all suffi- 
ciently large n there is a constant c such that ord;(x’ — 1) = n+ ce. Of 
course, c depends on 7. 

To return to our original situation, among the numbers {7; | 1 <1 < 2g} 
label them in such a way that for 1 <i < X; we have ord;(z; — 1) > 0 
and for A; < i < 2g we have ord;(7; — 1) = 0. For the first range, let c; be 
the constant associated to 7, by the above considerations. Then, for all n 


sufficiently large, 
29 
én = ord)(h(Kin)) = S— ordi(1 — 7)" ) = 
i=l 


XI 


AI 
S— ordi(1 —r) = Si(n +o)=Apn+M, 
i=] 


i=1 


where 1, = yt , ci. The proof is complete! 


Let l’ run through the primes other than /. What can be said about 
the behavior of the l’-primary components of the class group of Kjn? Sur- 
prisingly, these behave in an entirely different manner than the l-primary 
component. 
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Theorem 11.6. The numbers ord: (h(Kin)) are bounded from above. In 
other words, they increase up to some point no and then stay the same for 
alln > no. 


Proof. Once again, we start from the formula h(K,) = 124.1 — 17") 
taking place in Q, but now we reinterpret it to hold in Q, (see the first 
paragraph in the proof of Theorem 11.5). 

If ¢ £1 is an /”-th root of unity, then ord) (¢ —1) = 0. This follows from 


the identity 
"1 
0 = ord) (1") = S> ordy (1 — Gj) , 
j=l 
where the sum is over all /"-th roots of unity except 1. 

Suppose 7 € Q, is integral over Zp and ordy (a — 1) > 0. If ¢ # 1 is 
any /”-th root of unity, we must have ordy (a — ¢) = ordy (a —-1+1-—(¢) = 
ordy (1 —¢) = 0. Thus, ordy (2 —1) = Yeo ordy (a — ¢;) = ordy (a —1). 

Now, suppose ord) (7 — 1) = 0. There are two possibilities. Either 
ord; (x — 1) = 0 for all n, or there is a t such that ordy (rb —1)>0.In 
the latter case we must have ord) (1! — 1) = ordy- (a —1) for all n > t by 
the considerations of the last paragraph. 

For the set of elements {7; | 1 <2 < 2g} label the indices in such a way 
that for each 7 with 1 <i <d there is at; > 0 such that ordy (a! —1)>0 
and for i > d, ordy (x — 1) = 0 for all n > 0. Then, if n is bigger than 
max)<;<a(t;) we have 


2g d 
ordy (h( Kin)) = S/ordy (rt —i)= S- ordy (15 — 1) ; 
t=1 i=1 


Since this last sum does not depend on n, the result follows. 


Corollary. Let S(I") be the set of primes which divide h( Ki). Then, 
#S(I") — co as n - 00. 


Proof. By Proposition 5.11, there is a constant C such that h( Kin) > 
Cq?'". If S(I) remained constant from some point on, then combining 
Theorems 11.5 and 11.6, it would follow that h(/jn) would be equal to a 
constant times 1*’” for large n. Clearly, this is incompatible with the lower 
bound for the growth of h( Kj) just given. 


Up to now, we have not paid too much attention to the fact that the 
prime p, the characteristic of F, behaves differently from the other primes. 
Let’s pay a little more attention to this now. 

As before, let F be the field obtained by adjoining all the elements 7; to 
the rational numbers Q. Let P be a prime in EF lying above p. Given an 
index 7 let 7 be determined by 7,7; = q. Then either 7; or 7; or both must 
lie in P. It follows that at most g of the 7; do not belong to P. Let A be 
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the exact number that do not belong to P. We have 0 < ’ < g. Relabel 
the indices, if necessary, so that 7; ¢ P for 0 <i < A and 7; € P for 
A<1a< 2g. 


Lemma 11.7. Let Lx(u) be the reduction of Lx (u) modulo p. Then is 
equal to the degree of Lx(u). 


Proof. Lx (u) = \~*9_, amu” = [2 (1— au). Thus, each a is the m-th 
elementary symmetric function of the 7;. If m > A, each term of the m-th 
elementary symmetric function contains a factor 7; with j in the range 
A<j < 2g. Thus, am € P, which implies pla,,. Now consider a), the A-th 
elementary symmetric function of the 7;. One of the terms is 71712...7), 
which is not in P. All the other terms are in P. Thus, a, ¢ P. This implies 
that p does not divide a), and so the degree of Lx (u) is A, as asserted. 


Later we will give a nice algebraic interpretation to the number 4, which 
is sometimes called the p-invariant of K. For the moment, we consider in 
more detail the case \ = 0, i.e., the situation when all the 7; are in P. By 
the lemma, this is equivalent to Dx(u)’s having degree zero. It also yields 
the following interesting congruences: 


h(Kn) =|[[Q-—7") =1 (mod P). 


i=1 


It follows from this that none of the class groups J(F,,) contain an element 
of order p. We make this property into a definition. 


Definition. Let A/F be a function field over a finite field of constants F. 
Let p be the characteristic of F. We say that K is super-singular if for all 
integers n > 0, p does not divide h(K,,). 


This definition can be given in even greater generality. Assume F' has 
characteristic p > 0 (but is not necessarily finite) and that K/F is a func- 
tion field with F' as its field of constants. One defines K to be super-singular 
if the class group of every constant field extension of K contains no element 
of order p. 

The next proposition gives several equivalent conditions for a global func- 
tion field to be supersingular. Before we state it, recall that for each integer 
m > 0, b,,(4) denotes the number of effective divisors of K of degree m. 


Proposition 11.8. The following conditions are equivalent: 
a) K is supersingular. 
b) All the 7; are in P. 


c) The degree of Lyx (u) is zero. 
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d) h(K,) =1 (mod P) for all n > 0. 
e) bm(K) =1 (mod p) forl<m<g. 


Proof. We have already seen that b implies a. To show the reverse, suppose 
that some 7; ¢ P. Then, for some n > 0 we have 7? = 1 (mod P). This 
implies p|h(K,) contrary to a. Thus, a implies b so these two conditions 
are equivalent. 

Lemma 11.7 shows the equivalence of 6 and c. 

We have already seen that b implies d. If d holds then a obviously follows, 
and we have shown d implies 6. Thus, the first four conditions are all 
equivalent. 

Recall that Zx%(u) = S>°°_ bn (K)u” (see Chapter 5). From consideration 
of the equation (1 — u)(1 — qu)ZxK(u) = LK(u), we obtain the following 
congruence: 


fore) 29 
(1 — u) S- bn(K)u" = S- Anu™ (mod p). 


It follows easily that bm(K) = S7y".) ax (mod p) for all m > 0 (we define 
Am = 0 if m > 2g). 

Now, assuming that c holds, it follows from the last paragraph that e 
is true. Assuming e, it follows again from the last paragraph, that plam 
for 1 < m < g. Since dog_m = q%@m for m in the same range, we have 
plam for 1 < m < 2g. This is the same as c. All equivalences have been 
demonstrated. 


Before giving two corollaries, we note that h(K) = b,() (mod p). This 
is because h(K) = Lx (1) = 3279_) am = S29, Gm = bg(K) (mod p) by 


m=0 
what has been proven above. 


Corollary 1. Suppose K has genus 1. Then, K is supersingular if and 
only if h(K) =1 (mod p). 


Corollary 2. Suppose K has genus 2. Then, K is supersingular if and 
only if the number of primes of degree 1 and the class number, h(K’), are 
both congruent to 1 modulo p. 


Both corollaries are consequences of condition e and the above remark. 


We are now going to consider not just the size of the class groups J(F,), 
i.e., the class numbers h(K,,), but the actual structure of these groups. 
To do this it will be necessary to use more algebraic tools than we have 
previously, and also, as already has been said, a number of fairly difficult 
results whose only known proofs require a substantial amount of algebraic 
geometry. So we will only sketch these developments and not attempt to 
provide proofs for everything. Nevertheless, we hope to tell a coherent and 
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interesting mathematical story with enough detail so that the interested 
reader can, perhaps with some additional work, fill in the gaps. 

We begin with some generalities about constant field extensions. For a 
while we only require that the constant field F' of the function field K 
be a perfect field. With this assumption, we can use all the results about 
constant field extensions proven in Chapter 8. 

The first result we will need is even more general. 


Proposition 11.9. Let L be a finite, unramified, Galois extension of the 
function field K. Let G = Gal(L/K). Then, DE = injKDxK. Here, Dx and 
Dz denote the divisor groups of K and L, respectively, and DE denotes the 
divisors of L left fixed by the elements of G. 


Proof. If P is a prime of K, then, by definition, iz;~%.P = DU BIP e(B/P)P. 
Since we are assuming that L/K is unramified, this becomes i; JKP = 
Dp p ®. 

Suppose D = ) 1, a()P is a divisor of L fixed by all the elements o € G. 
Applying o to both sides, using oD = D, and equating coefficients, yields 
the result that a(o) = a(%) for all primes % of L. By Proposition 9.2, 
G acts transitively on the set of primes of L lying over a fixed prime P 
of K. It follows that D is a Z-linear combination of divisors of the form 
ti/KP. This shows that D € i,;~Dx. Thus, DEC injK Dx. The converse 
is obvious. 


From the definition, it is clear that iz/~% gives a one-to-one homomor- 
phism from Dx to D,. By the corollary to Proposition 7.8, 1;/x induces a 
homomorphism from Clx to Cl,. On this level it need not be one to one 
in general. However, in the case of separable constant field extensions it is 
one to one. Even more is true when the constant field extension is Galois. 


Proposition 11.10. Let K be a function field with constant field F. Let E 
be a finite, Galois extension of F and set L = KE. Let G = Gal(E/F) = 
Gal(L/K). Then itsK : Cle — Cly is one to one, and CIE = tr/KClK. 


Proof. The proof will use some facts from cohomology of groups. 
Let Pz; and Px be the principal divisors of L and K, respectively. Con- 
sider the exact sequence: 


(0) > E* > L* > Py > (0). 


Passing to the long exact sequence and using the fact that H1(G, E*) = (0) 
(Hilbert’s Theorem 90), we find that the following sequence is exact 


(0) > F* + K* = PS - (0). 


This shows that ip;«PK = PF. 
Using the same associated long exact sequence, but starting at the term 
H1'(G, L*), which is zero, again by Hilbert’s Theorem 90, we find 


(0) + H1(G,P,) + H?(G, E*) > H?(G,L") 
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is exact. The third arrow can be shown to be one to one by a localiza- 
tion argument. It follows that H!(G,Pz,) = (0). In the case of finite con- 
stant fields, which is our principal interest, the result is even easier. In this 
case H?(G, E*) is isomorphic to F*/Nz >E* since G is cyclic. For finite 
fields the norm map is onto. Thus, in this case, H?(G, E*) = (0) and so 
H'(G,P_) = (0) as well. 


Next, consider a second exact sequence: 
(0) > Pr, —> Dy > Cl, > (0). 
Passing to the associated long exact sequence, we find 
(0) + PE + Df > Cl > H'(G,Pz) 


is exact. By what we have already proven, the first term is iz/%Px, the 
second is iz/~%Dx (by Proposition 11.9, since constant field extensions are 
unramified), and the fourth term is zero. Thus, 


(0) > Px > Dx > CI? > (0) 


is exact if we define the third arrow to be the map iz/x. This is equivalent 
to the assertions of the proposition, so the proof is complete. 


Corollary. With the same hypotheses as the proposition, in;% restricted 
to lf, is one to one and iz; KCI = (Cl¥¢ )e. 


Proof. The restriction of a one-to-one map is still one to one, so the first 
assertion is obvious. 

To prove the second assertion, suppose D € (Cl? )°. By the proposition, 
there is a class D € Clx such that i, jKD = = ©. By Proposition 7.7, 
deg, injKD = degy D. Since deg, D = 0, it follows that deg, D = 0, 
which completes the proof. 


We now attempt to package all the class groups of constant field ex- 
tensions of K into one big group. To do this we will need the notion of an 
infinite Galois extension of fields and the associated Galois group. Let L/K 
be an algebraic extension of fields, of finite or infinite degree. Let Aut(L/K) 
be the group of all field automorphisms of L which leave K fixed. If K is 
the fixed field of Aut(L/K), we say that L/K is a Galois extension and 
define Gal(L/K) = Aut(L/K) to be its Galois group. It is easy to check 
that L/K is a Galois extension if and only if it is the union of all finite 
Galois extensions of K contained in L. The fundamental theorem of Galois 
theory relating subgroups of Gal(Z/K) with intermediate fields does not 
hold in the general case, but it can be reestablished by defining a topology 
on the Galois group. One proclaims a neighborhood basis for the identity 
element to be the set of subgroups of finite index. This leads to a unique 
topology on the Galois group which makes it into a topological group. This 
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topology is called the Krull topology. It then turns out that the usual pro- 
cedure now yields a one to one correspondence between closed subgroups 
of the Galois group and intermediate fields. All the standard properties of 
this correspondence continue to hold. One simply has to be careful that all 
subgroups under consideration are closed. 

We continue to assume that F is perfect. Let F’ denote an algebraic 
closure of F. Then, one sees easily that F’/F is an infinite Galois extension. 
We set Gr = Gal(F'/F). If F C E C F is an intermediate field we easily 
see that F’/E is a Galois extension and we write Gg = Gal(F'/E). Gz isa 
closed subgroup of Gp. It is a normal subgroup if and only if & is a Galois 
extension of F’. In this case Gal(E/F) = Gr/Ge. 

We now return to the situation where K is a function field over a perfect 
constant field F’. For every field E between F and F’, we define J(E) to be 
the group Cl? 7. Proposition 11.10 and its corollary which we proved for 
finite constant field extensions continue to hold for infinite constant field 
extensions. Thus, there are one-to-one maps from all the groups J(E) into 
J(F) given by extension of divisors. We will identify J(E) with its image in 
J(F), In this way all the groups J(E) are subgroups of J(f) and one can use 
the corollary to Proposition 11.10 to show that J(F)°* = J(E). Of course, 
the same corollary shows that if E/F is Galois, then J(E)S#(4/") = J(F). 

Another useful property is that J(F) is the union of the groups {J(E) | [E : 
F’ < co}. The idea of the proof is to use Propositions 8.10 and 8.11 to show 
that every prime %B of K F' is the extension of a prime coming from a finite 
level. If P is a prime of K lying below , then Proposition 8.11 shows there 
is a finite extension F’/F' such that P splits into a product of primes of de- 
gree 1 in KE. Let p be the prime in K E lying below $8. Then Proposition 
8.10 can be used to show that $8 is the extension of p to KF. 

It will simplify the notation to simply call J(F’) = J. The advantage 
of considering the group J is that algebraic geometry gives a method for 
determining a great deal about its structure. We then attempt to use this 
information to investigate the structure of the groups J(£) where E is a 
finite extension of F’, 


Theorem 11.12. Let K/M be a function field over an algebraically closed 
field of constants M. Set J = Cl%,. Then, J is a divisible group (for all 
integers n, the map x — nx is onto). Denote by g the genus of K. If is a 
prime different from the characteristic of M, then the l-primary subgroup 
of J, J(l), is the direct sum of 2g copies of Q;/Z,. If the characteristic 
of M is p > 0, then J(p) is the direct sum of y copies of Q,/Zp where 
OSYSQ. 


In the case where M = C, the complex numbers, it is a classical theorem 
(Abel-Jacobi) that J is isomorphic to the direct sum of g copies of C modulo 
a lattice A of maximal rank, i.e., J & C9/A. A must be a free Z module of 
rank 2g. It follows that the elements of J of order dividing /” constitute a 
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group isomorphic to -A/A = ae 7Z/Z. Passing to the limit as n — oo 
gives the theorem. 

The first proof of the theorem in the abstract case was due to Weil [2]. 
The reader can also consult Mumford [1]. 

The group Q;/Z; has a very simple structure. It is the union of the 
subgroups LI /Z, which are isomorphic to Z,/l"Z, = Z/l"Z, a cyclic group 
of order /”. Thus, there is a one-to-one correspondence between proper 
subgroups of Q,/Z; and non-negative powers of |. This allows us to deduce 
a simple corollary from the theorem. 


Corollary. Define J|N] to be the subgroup of J consisting of elements 
whose order divides N. If N is not divisible by the characteristic of M, 
then 


29 
JIN] = PBZ/NZ. 


Proof. By the Chinese Remainder Theorem, it is enough to check the result 
when N = /” is a power of a prime different from the characteristic. In this 
case the corollary is immediate from the theorem and the above remark. 


Let Y Cc J be a finite subgroup of J which is invariant under Gf, i.e., 
oy € Y for allo € Gp and all y € Y. For example, Y = J[N] is such 
a subgroup since Ny = 0 implies 0 = o(Ny) = N(oy). It is immediate 
that each o € Gp induces a group automorphism of Y. Thus, we get a 
map Gp — Aut(Y). This is easily seen to be a group homomorphism. The 
kernel is a normal subgroup H of Gp of finite index. By definition, such 
a group is closed and thus it corresponds uniquely to its fixed field which 
we will call F(Y). Thus, H = Gprry). Since Gry) fixes Y we see that 
Y C JOrm = J(F(Y)). F(Y) is called the field of rationality of Y. It is 
the smallest extension E//F' such that Y C J(E). 

The exact sequence 


(0) > Gry) > Gr > Aut(Y), 


gives rise to a monomorphism from Gal(F(Y)/F’) = Gr/Gryy) — Aut(Y). 
Suppose JN is a positive integer prime to the characteristic of F. By the 
Corollary to Proposition 11.12 we have 


JN] & BD z/Nz | 
1 


It follows that Aut(J[N]) = GL2,(Z/NZ). Putting it all together we get a 


monomorphism 


pn : Gal(F(J[N])/F) 3 GLo,(Z/NZ) . (1) 
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This “Galois representation” plays a big role in the more advanced 
arithmetic theory of curves. Here we will leave off this general develop- 
ment and return to the special case where the constant field is finite. So, 
once again, let us suppose that F’ = F, a finite field with q elements. Let 
oq € Gp = Gal(F/F) be the automorphism defined by ¢,(a) = a4 for all 
a € F. For every positive integer m, let F,,, C F be the unique intermediate 
field with [F,, : F| = m. The restriction of ¢, to F,, generates the Galois 
group Gal(F,,,/F). The image of ¢, in this group has order m. It follows 
that the subgroup of Gr corresponding to the field F,, is the closure of the 
cyclic group < ¢;' > generated by oy in Gg. We note the fact that Gr is 


isomorphic to the inverse limit of the groups Z/mZ, i.e. the group Z,, the 
completion of Z with respect to all subgroups of finite index. 


Proposition 11.13. Let J[N] Cc J be the subgroup of J consisting of 
elements whose order divides N. Then, [F(J[N]) : F] is equal to the order 
of the matriz pn (dq) (see Equation 1 above). 


Proof. Since py is a monomorphism, the order of py(¢q) is the smallest 
power of @, which is the identity on F(J[N]). By the Galois theory of finite 
fields, this number is the dimension [F(J[N)]) : F). 


Proposition 11.14. For all but finitely many primes |, the dimension 
[F(J[2]) : F] is prime to l. 


Proof. We will need to use one of those basic facts about J whose proof 
involves a substantial amount of algebraic geometry. It concerns the char- 
acteristic polynomial of the matrix p;(¢,). We will assume | 4 p, the char- 
acteristic of F. 

Let Lx(u) be numerator of the zeta function Zx(u). Define L*(u) = 
u*9 Lx (1/u). L*(u) is a monic polynomial of degree 2g with coefficients in 
Z. Let D € Z denote the discriminant of this polynomial. The fact we need 
is that the characteristic polynomial of ;(¢,) is the reduction of L*(u) 
modulo J, This is implied by the characterization, due to Weil, of L*(u) 
as the “characteristic polynomial of Frobenius” acting on the Tate module 
at 1. See Milne [1], Mumford [1], or, for the original formulation, Weil [2]. 
It follows from this that if / does not divide D, then the characteristic 
polynomial of p;(¢,) does not have repeated factors in Z/IZ[u]. By linear 
algebra p(¢q) is diagonalizable over the algebraic closure of Z/IZ. The 
eigenvalues , which are non-zero since | #4 p, have order prime to J. It 
follows that the matrix p;(¢,) has order prime to |. The proposition is now 
a consequence of Proposition 11.13. 


Because of these propositions, among other reasons, it is of interest to 
investigate the structure of the matrix groups GL,(Z/NZ) where r and N 
are positive integers. We will sketch some of the main facts about them. 
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Suppose N = 1/15"... 1?" is the prime decomposition of N. Then using 
the Chinese Remainder Theorem, it can be seen that 


GL,(Z/NZ) &% GL,(Z/U™Z) x GL, (Z/U?Z) x ...GLp(Z/UZ) . 


This reduces the problem to the structure of the groups GL,(Z/I™Z), 
where / is a prime number. If m > 1, reduction modulo / gives rise to an 
exact sequence: 


(I) + 1 +1M,(Z/l"Z) + GL,(Z/I"Z) + GL,(Z/IZ) = (1) . 


Here, J denotes the identity r x r matrix and M,(Z/lZ) the ring of r xr 
matrices with coefficients in Z/1’Z. The order of the group [+/M,(Z/I™Z) 
is I” (™—1)_ Tt is an l-group whose structure can be investigated more closely. 


However, we will not enter into further details about this here. 
The size of the group GL,(Z/IZ) is given by 


IGL,(Z/IZ)| = (I —1)(l" —1)...(I" — 17") 
ner-Y/2qr _ qy(r-t —1)...(1—1). 


The proof of this is obtained by noticing that an r x r matrix with coeffi- 
cients in a field is invertible if and only if its rows are linearly independent. 
Thus, the first row must be non-zero. It can be any of /” — 1 row vectors. 
The second row must be linearly independent from the first, so it cannot 
be a multiple of the first row. Thus, there are /” — 1 choices for the second 
row. There are /? vectors in the linear span of the first two rows, so there 
are |” — |? choices for the third row. The general pattern is now clear. 
We summarize a portion of this discussion as follows. 


Proposition 11.15. The group GL,(Z/I™Z) has order equal to 
mm) qr — 4y(m-2 —1)...(0-1) 


where f(r,m) = (m—1)r? +r(r —1)/2. 


Note, in particular, that GL,(Z/I™Z) has a large I-Sylow subgroup whose 
order depends on m, whereas the “prime to 1” part of the group has order 
which is independent of m. This will be of importance later. 

We have developed most of the tools we will need for the next task, 
which is to give a structural, algebraic interpretation to Theorems 11.5 
and 11.6. To recall the situation, fix a prime / and consider the tower of 
fields K Cc Ki C K,2 C .... The two theorems in question were about the 
way the class numbers h(K;») behave as a function of n. We will now look 
at the more general question of how the groups J(F)») behave as a function 
of n. 

It will be convenient to define F;~ to be the union of the fields Fn where 
m varies over the positive integers. It is not hard to check that for every 


11. The Behavior of the Class Group in Constant Field Extensions 183 


finite extension E of Fj. we have [E : Fj] is prime to J. Thus, Fico is 
characterized as the maximal /-extension of F in F. Note that the group 
J(Fj-) is the union of the groups J(F/) over all n. 

We first consider the l-primary components. Recall 


I(Fi)(l) € J(! = Payn. (2) 
Proposition 11.16. We have the following group isomorphism: 


J (Fico )(1) = Baye. 


where r; is the dimension over Z/1Z of J[l]N J(Fi~). 


Proof. (Sketch) We first show that J(F). )(1) is a divisible group. Consider 
the exact sequence of Gr modules 


(0) + J[l] > J(l) > J(l) - (0) , 


where the third arrow is the map “multiplication by J.” This is onto since J 
is a divisible group and this easily implies that the /-primary component of 
J is divisible. Let H C Gp be the subgroup corresponding to Fj... Passing 
to the associated long exact sequence we find 


J(Fie)(1) + J(Fro)(1) > H'(H, J[l]) 


is exact. The first arrow is, again, multiplication by |. Every finite quotient 
of H is prime to J. It follows that H'!(H, J[l]) = (0). Thus, multiplication 
by | is onto and J(F).)(I) is divisible. 

From Equation 2 it now follows that 


J (F joo )( = Past ; 


where 1 < r; < 2g. If one considers the subgroups of both sides of this 
isomorphism consisting of the elements of order dividing | we obtain the 
characterization of r; given in the proposition. 


The following group theoretic lemma and its corollaries are the key to 
understanding the behavior of the groups J(Fi-)(I). 


Lemma 11.17. Suppose | is an odd prime, and that A is an abelian group 
isomorphic to G, Q:/Z. Let 6 : A + A be an endomorphism. Define 
Ao = {x € A| d(x) = x} and A, =: {x € A |d! (x) = x}. Suppose that Ao 
contains A|l] = {a € A |la=0}. Then, we have 


A,={xEA|lxre Ao}. 


[fl = 2, the same result holds provided we assume Ao contains A[4]. 
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Proof. We assume | is odd and begin by showing there is an endomorphism 
w of A such that 6 = [+ 1w where I is the identity endomorphism. 

Since A is divisible, given x € A, there is a y € A such that ly = 
xz. Set v(x) = d(y) — y. This is well defined, because if ly’ = z, then 
y—yé€ All. Since ¢ — I vanishes on All] by hypothesis, we must have 
d(y) —y = o(y’) — y’. It is easily verified that w is an endomorphism. 
Finally, “wl ) = o(ly) —ly = o(x) — 2, s0 6 =1+ ly as asserted. 

Thus, 


d=(1+ly/=I+ ({)w+ (9) Pv be T+ PU + ly) 


where pu is an endomorphism of A which commutes with w. 

The endomorphism J + ly is invertible because A being a torsion group 
implies that the formal inverse (I +1u)~! =I —lp+l*p? —... gives an 
actual inverse. 

We find that ¢'(x) = x if and only if l?W(1 4+ lu)(x) = 0 if and only 
if 1?¢b(r) = 0 (since w and 1+ lu commute). This last condition can be 
rewritten as ly(lx) = 0. Adding Iz to both sides, we see this condition is 
equivalent to lx + ly(lx) = lz, or, what is the same ¢(lx) = lx. We have 
shown that z is fixed by ¢! if and only if lz is fixed by ¢. This is equivalent 
to the statement of the lemma. 

If | = 2 and A[4] C Ao, the proof is entirely similar. We leave the details 
to the reader. 


Corollary 1. In addition to the hypotheses of the lemma, assume that Ag 
is finite. Then, 


r r 
Ap = hi“ Z,/Z, and A, & Bin’ 1Z,/Z, . 
i=1 = 
Proof. Since Al] C Ao and Ag is finite, it follows that Ao is the direct 
sum of r cyclic groups each of J-power order. This gives the first assertion. 
Rephrasing what has been shown so far, Ao has a set of generators 
{e1,€2,...,e-} such that yi=1 nje; = 0 if and only if 1”|n; for 1 <i <r. 
Since A is divisible, for each 7 there is an element e} € A with le} = e,. Let 
A‘, be the group generated by the set {et,e4,...,el}. Clearly, Ai © At. 
They both have the same number of elements since multiplication by | maps 
both sets onto Ao and the kernel in both cases is All]. Thus, Aj = Aj. It 
is now a simple matter to show }~,_, nie} = 0 if and only if P+ in, for 
1<i<pr. This is equivalent to the second isomorphism in the statement 
of the Corollary. 


Corollary 2. With the notation of Corollary 1, define An = {x € 
A | ¢' (x) = x}. Then, 


A, = Drv, /2, . 
1=1 
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Proof. The proof is by induction on n. Corollary 1 shows the result is true 
for n = 1. Now assume it is true for n — 1. Then apply Corollary 1 with Ao 
replaced by A,—1 and ¢ replaced by ¢'” . The result follows. 


We remark that the proofs may be given using properties of modules 
over Z,. The Pontyagin dual of A, A, is a free module of rank r over Z; and 
¢ induces an endomorphism of A. One can then prove the dual statements 
to those in the lemma and its corollaries by using properties of modules 
over Z,. Then dualizing once again we get what we want. It seemed more 
straightforward to work directly with A. 

The next result is, perhaps, the main result of this chapter. 


Theorem 11.18. Let K/F be a function field of genus g over a finite field 
F with q elements. Let J = Cl°(KF) and define r; to be the dimension 
over Z/IZ of Jl] N J(Fio). There is an integer no > 0 and integers Vy; with 
1<i<m7 such that for alln > no we have 


Tl 
J(Fire)(l) & Bie tro" Z/Zy . 
' | 


Proof. Define no by the equation Fyo A F(J[{l]) = Fino. It is not hard to 
see that every element in J{!]M J(F)) is rational over this field. Invoking 
Proposition 11.16, we find 


I(F ye )(1) & Da /Zi . 


Also, there exist integers v; for 1 < i < r; such that for each i, vy; > 0, 
and 


1 
J(Fino )(l) & GH“ Zy/Z, . 
1 
Now, define A = J(Fio)(!), Ao = J(Firo)(l), and ¢ = ¢f°. One sees 
that ¢ is the Frobenius automorphism for the extension F/F,, and that 


the triple A, Ao, and @ satisfy the hypotheses of Lemma 11.17. Invoking 
Corollary 2 to that lemma, we see that for all m > 0 we have 


rl 
J(Frorm)(l) & Gre Zi /Z 
1 


If we simply set n = no +m, the theorem follows. 
Corollary. For alln > no, we have 
ord; h( Kin) =rin+y , 


where 4 = ord; h( Kino) — nino. 
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Proof. We recall that h(Kj~) is the order of J(Fi-). From the isomorphism 
given in the theorem, we see 


rl 


ord; h( Kin) = So +n—no)=rynt+ Soi — no). 


i=1 i=1 


Define 1, = oie (Yi — ng) (and ignore the fact that the notation here is 
somewhat ambiguous). If we set n = no in the above formula, we get the 
characterization of 4 given in the corollary. 


The reader will not fail to notice that the corollary is a sharpened version 
of Proposition 11.5. The qualitative content is exactly the same, but now 
we have given group theoretic interpretations of the constants r; and 1. 
Moreover, even no is now made more precise. 

The situation becomes much simpler when Fj. NF(J(l}) = F. By Propo- 
sition 11.14, this happens for all but finitely many primes J. In this case, 
no = 0. Thus, r; is just the number of cyclic groups whose sum is J(F) (1) 
and the regular behavior of the / primary parts of the class group begins at 
the first step. Also, in this case 1 = ord; h(K). Everything is really simple! 

We can now give a group theoretic interpretation of Proposition 11.6 as 
well. For notational convenience we will use k for the second prime instead 


of Ll’. 


Theorem 11.19. Let! and k be primes with|#~k. Assume k # p. Define 


no as follows 
29 


no = ord, [[@ —1). 


i=] 
Then, J (F joo ) ‘a J(k) Cc J (Fino ). 
Proof. Let P € J(Fj.) have order k™. Then, P € J[k™| and so P is ratio- 


nal over both Fy. and F(J|k™]), and is thus rational over their intersection. 
By Equation 1 (just before Proposition 11.13) we have a monomorphism 


Gal(F(J[k™])/F) > GLog(Z/k™Z) . 
By Proposition 11.15, the later group has an order which is a power of k 
times []72 ,(k'—1). Thus, the highest power of | which divides [F(J[k™]) : F] 
is less than or equal to no. It follows that 
Fro ON EF(J[k™]) C Fino . 
This containment holds for all m > 0. Thus, 


Fj A F(J(k)) © Fyro , 


which is equivalent to the theorem. 
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It is clear that Theorem 11.19 implies Theorem 11.6 together with the 
improvement that we get an upper bound on no. It also gives some added 
insight into why the result is true. 

The restriction in the statement of the theorem that k # p is not essential. 
One simply has to modify the definition of no slightly and then the result 
holds when k = p as well. 

The final theorem of this chapter concerns the reduction of the polyno- 
mial Lx%(u) modulo p. In Proposition 11.7 we showed this reduced polyno- 
mial has degree \ where 0 < XA < g. In Proposition 11.8, we showed that 
K is supersingular if and only if 1 = 0. This can be restated as, J[p] = (0) 
if and only if A = 0. Using the tools developed to this point we can give a 
far-reaching generalization of this. 


Proposition 11.20. With previous notations, let y be the dimension over 
L/pL of J{p|. Then, y is equal to X, the degree of the polynomial Lx (u) 
reduced modulo p. 


Proof. Let Lx(u) = [72 ,(1 — mu) be the factorization of Lx(u) over 
the algebraic closure of the p-adic numbers. Let E be the subfield of Q, 
obtained by adjoining the elements 7, to Q,. Finally, let P be the maximal 
ideal of the integral closure of Z, in E. 

By the above remarks we can assume A > 1. By using the proof of 
Proposition 11.7 we can assume ord,m; = 0 for 1 <i < A and ordym, > 0 
for A <1 < 2g. By passing to a constant field extension K, = KF,, the 
elements 7; are replaced by 7”. Using the proof of Lemma 11.7, we easily 
see that the degrees of Lx(u) modulo p and Lx, (u) modulo p are the 
same. By an appropriate choice of n we can insure 7% = 1 (mod P) for 
1<i< A. By passing to a further constant field extension Knm = KFnm, 
we can insure J[p] C J(Frm). This does not affect the congruences already 
established, so we can assume from the beginning that ord,(7;—1) > 0 for 
1<i<X and J[p] C J(F). 

Since J[{p] C J(F) it follows from the proof of Theorem 11.18 and its 
Corollary that ord, h( Kp») = yn + v for all n sufficiently large. We will 
now establish a similar formula with y replaced with 4. 

Recall that 


29 


h(Kpn) = [[a —1; ). 


i=l 
The terms in the product with \ < i < 2g are units since in this range 
ord,7; > 0. Thus, 


a 
ordy h(Kpn) = S— ord,(1— 7?) . (3) 
i=1 


In the proof of Proposition 11.5, we showed that if ord,(1 — 7;) > 0, then 
for all sufficiently large n, ord,(1 — 7? ") = n-+c;. It now follows from 


a 
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Equation 3 that ord, h( Kp») = An+c for all sufficiently large n, where 
C= wy Cj. 

We have now established that ord, h(Kpn) = yn+v = An + ¢ for all 
sufficiently large n. This can only happen if A = ¥. 


This theorem is well known, but the usual proof uses much more sophis- 
ticated methods, e.g., crystalline cohomology. The above proof is due to H. 
Stichtenoth. 


In the late 1950s, K. Iwasawa began publishing papers which carry over 
some of the theory developed in this chapter into algebraic number theory. 
The key idea is that constants in function fields are like roots of unity in 
number fields. To see why this is so, let K/F' be a function field over a field 
of constants F’. Let Cp > 1 be a constant and define for x € K and Pa 
prime divisor of K, 

\z|p _ Cyt) . 

It is easily checked that |x|p is a non-archimedean valuation of K; ice., 

it satisfies 


(a) |O|p =O (0b) \1lp=1 
(c) |zylp =|z|plylp (d) |z + y|p < max(|z|p, |y|p) . 


Up to a standard equivalence relation this set of valuations is a complete 
set of valuations of K. By Proposition 5.1, an element x € K™ is a constant 
if and only if ordp(x) = 0 for all P € Sx. This is equivalent to saying that 
x € K™ is a constant if and only if |z|p = 1 for all P € Sx. 

If K is an algebraic number field, let Ox be the ring of integers. If P is 
a maximal ideal in Ox and x € K, define 


|| p = N P7ordr (2) 


This is a non-archimedean valuation for each maximal ideal P C Ox. 
The equivalence classes of these valuations are called the finite primes of 
kK. In addition to the non-archimedean valuations there are finitely many 
archimedean valuations. These are obtained by imbedding K into the com- 
plex numbers and then applying the usual absolute value. By definition, 
these archimedean valuations correspond to primes at infinity. It is a well 
known theorem, due to Kronecker, that 2 € K* is a root of unity if and 
only if |z|p = 1 for all primes of K, both finite and infinite. 

From this discussion, it is clear that constants and roots of unity are 
analogous concepts, so it makes sense to think of cyclotomic extensions of 
a number field as analogous to constant field extensions of a function field. 
This is exactly what Iwasawa did. 

What is the analogue of the cyclic l-towers of constant field extensions 
we have considered in this chapter? Consider the cyclotomic field Q(¢jn+1) 
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where Cjn+1 is a primitive /”+1-th root of unity in a fixed algebraic closure 
Q of Q. The dimension of this field over Q is /"(J — 1) and there is a unique 
subfield Bj such that [By : Q| = Il”. The tower Q C Bi C Be C ... 
is called the basic Z, extension of Q. Note that Gal(Bij»/Q) is a cyclic 
group of order I”. Let Bjoo = US? Bin. The Galois group of Bj over Q is 
isomorphic to Z,, which is why this tower of fields is called a Z)-tower. 

If K is an arbitrary number field we define K)~ to be the compositum 
K Bj. The Galois group of Kj over K is isomorphic to Gal(Bjo/K M 
Bi) which is isomorphic to a subgroup of Z; of finite index and is thus 
isomorphic to Z;. It follows that K,.~ is a Z; extension of K and there is 
a tower of extensions K C K; C Kp Cc... Kio with Kin being a cyclic 
extension of K of degree 1”. The field Kj is taken to be the analogue of 
the constant field extension KF). It is called the cyclotomic Z, extension 
of K. 

One can now ask the question if there is an analogue of Proposition 11.5? 
Is there an asymptotic formula for ord; h( Aj.) in the number field case? It 
is a remarkable fact that the answer is yes. The zeta function of a number 
field is a much more complicated and mysterious object than that of a 
function field. Also, there is no obvious appeal to algebraic geometry as 
in the case of function fields. Nevertheless, Iwasawa was able to prove the 
following result (see Iwasawa [1] and [3], Lang [6], and Washington [1]). 


Theorem 11.21. (K. Iwasawa) Let K be an algebraic number field, | a 
prime number, and K,~ the cyclotomic Z, extension of K. Let h(Kin) be 
the class number of Kin. Then there are integers X1, 1,41, and no, such 
that for alln > no, 


ord; h( Kin) = Ain + yl” + . 


Actually, Iwasawa was able to prove this for any Z; extension of kK. An 
infinite number field L is said to be a Z extension of K if it is a Galois 
extension and Gal(L/K) & Z,. In general, there exist many such extensions 
in addition to the cyclotomic Z, extension. 

The formula given in the theorem is of precisely the same type as that 
given in Proposition 11.5 and the analogy is even more precise if uw, = 0. 
Iwasawa conjectured that 4; = 0 for the cyclotomic Z, extension. For the 
case of general number fields this is still an open question. However, if kK 
is a Galois extension of Q with Gal(K/Q) abelian, then Ferraro and L. 
Washington were able to show pu; = 0 (see Washington [1], Chapter 7), so 
in these cases the analogy is perfect. 

It is worth pointing out that j1; is not always zero. Iwasawa gave examples 
of non-cyclotomic Z; extensions where p, # 0 and even showed that py; can 
be made to be as large as you like (see Iwasawa [2]). 

Washington was also able to show that the analogue of Proposition 11.6 
is true for cyclotomic Z, of number fields (see Washington [1], Chapter 16). 
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There are many interesting open questions remaining in Iwasawa’s the- 
ory of number fields. There are also generalizations of these concepts to 
arithmetic-geometric contexts. For example, if A is an abelian variety over 
a number field K and L is a Z, extension of K, is there an asymptotic 
formula for the Mordell-Weil rank of A( Ki») as a function of n? This is the 
subject of a fascinating paper of B. Mazur [1]. However, we will have to be 
content with pointing out these directions and pass on to other matters. 


Exercises 


1. 


Let K/F be a function field over a finite field with q elements. Let 
Dx(u) = yd 9 anu” be the numerator of the zeta-function of K. 
Show that the functional equation (see Chapter 5) implies that 
qi-*a, = Qag-k forO<k <Q. 


. Suppose K’/F has genus 1 and that / is an odd prime dividing q — 1. 


Show that l|h, for some n dividing (1? — 1)/2. (Hint: If Lx(u) = 
(1—7,u)(1—7eu), show that ri = 7 or m2 (mod CL), in the notation 
of Proposition 11.1). This fact is due to J. Leitzel. 


. Let K/F be a function field of genus 1 over a finite field with q 


elements. Write Lx(u) = 1 — au + qu”. Suppose | is a prime such 
that (a? — 4q / 1) = 1 (the Legendre symbol). Show I|h, for some 
nll — 1. 


. Let A/F be a function field of genus 2 and characteristic p. Let h 


be the class number of K and N, the number of primes of degree 1. 
Suppose that h = N; (mod p) and that N; #1 (mod p). Show that 
p|h» for some integer n dividing p — 1. (Hint: Make use of the proof 
of Proposition 11.8.) 


. Prove the case | = 2 of Lemma 11.17. 


. Let A/F be a function field over a finite field and let J be the as- 


sociated divisor class group over the algebraic closure of F. Suppose 
| # 2 is a regular prime in this situation, i.e., [F(J[l]) : F] is prime 
to !. Use Lemma 11.17 to show directly, i.e., not as a corollary to 
Theorem 11.18, that ord; h( Kin) = ryn + ord; h(’), where 7; is the 
dimension over Z/IZ of J(F)|I]. 


. Let F be a field with q elements and K = F(z, y) be a function field 


with generators x and y which satisfy Y*? = f(X), where f(X) € 
F[X] is a cubic polynomial without repeated roots. Assume that g 
is odd. Prove that 2 is an irregular prime if and only if f(X) is the 
product of a linear factor and an irreducible quadratic. (One needs 
some elementary facts about points of order 2 on an elliptic curve). 
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8. Suppose F = Z/5Z and K = F(z,y), where x and y satisfy Y? = 
X? — 3X. This is a curve of genus 1. The primes of degree 1 of K 
are in one-to-one correspondence with the solutions of this equation 
over F together with one prime at infinity. Use this to show Ni(K) = 
h(K) = 2 and deduce Lx(u) = 1 — 4u + 5u*. Deduce from this that 
(we drop K from the notation): N; = 2, No = 275, N4 = 25, and 
Ng = 29375 - 17. 


9, (Continuation) One can show that all the points of order dividing 4 
are in J(F4). Use this and the previous exercise to show 


J (Fo) )(2) & 2-I-'Z/Z @ 2°75 Z/Z , 
for all 7 > 2. 


10. (Continuation) Show [F(J[5]) : F] = 2 and deduce that J(Fs,)(5) = 
(0) for 7 > 0, J(Fo.5))(5) & Z/59Z for 7 > 0, and J(Fo;)(5) & Z/5Z 
for 7 > 1. 


12 


Cyclotomic Function Fields 


In the last chapter we explored the arithmetic of constant field extensions 
and noted (as was pointed out by Iwasawa) that these extensions can be 
thought of as function field analogues of cyclotomic extensions of number 
fields. This analogy led to various conjectures about the behavior of class 
groups in number fields which have proved very fruitful for the development 
of algebraic number theory and arithmetic geometry. There is another func- 
tion field analogy to cyclotomic number fields which was first discovered 
by L. Carlitz [3] in the late 1930s. This ingenious analogy was not well 
known until D. Hayes, in 1973, published an exposition of Carlitz’s idea 
and showed that it provided an explicit class field theory for the rational 
function field (see Hayes [1]). Later developments, due independently to 
Hayes and V. Drinfeld, showed that Carlitz’s ideas can be generalized to 
provide an explicit class field theory for any global function field, i.e., an 
explicit construction of all abelian extensions of such a field (see Drinfeld 
[1] and Hayes [2]). This is a complete solution to Hilbert’s 9-th problem in 
the function field case. Nothing remotely so satisfying is known for num- 
ber fields except for the field of rational numbers (cyclotomic theory) and 
imaginary quadratic number fields (the theory of complex multiplication). 

It is interesting to note that this discussion shows how the power of the 
number field-function field analogy is useful in both directions. The theory 
of constant field extensions in function fields gave rise to Iwasawa theory in 
number fields. The extensive theory of cyclotomic number fields gave rise 
to the work of Carlitz, Drinfeld, and Hayes which provided a way explicitly 
to construct all abelian extensions of a global function field. 
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The impetus to function field arithmetic given by these developments has 
led to many new ideas and developments. One direction, which we will not 
be able to discuss, is the invention of characteristic p-valued L-functions by 
D. Goss. These functions share many properties of their classical analogues. 
In particular, their special values can be related to the arithmetic of fields 
generated by adding torsion points on Drinfeld modules. On the other hand, 
they do not seem to possess functional equations. For a survey of these 
developments, see Goss {2]. Another interesting reference is Thakur [2]. 
For a more comprehensive treatment, one should consult the treatise by 
Goss [4]. 

In this chapter we will deal almost exclusively with Carlitz’s construction 
of what we will call cyclotomic function fields. This is a special case of far 
more general constructions, but it contains most of the features of the 
general theory and has the advantage of being very down to earth and very 
close to our initial theme in this book; the analogy between the rational 
integers Z and the ring of polynomials over a finite field A = F/T]. 


We begin by recalling, mostly without proof, several features of the the- 
ory of cyclotomic number fields. Let m > 2 be a positive integer and ¢,, € C 
a primitive m-th root of unity. Then, the field K,, := Q(Gn) is called the 
m-th cyclotomic number field. It is the splitting field of x™ — 1 € Q|z], so 
it is a Galois extension of Q. If o € Gal(Ky,/Q), then o(¢,,) = ¢%,, where 
a is relatively prime to m and is only determined modulo m. This gives 
rise to a monomorphism Gal( K,,/Q) — (Z/mZ)*. This map can be shown 
to be onto (the irreducibility of the m-th cyclotomic polynomial). Thus, 
Gal(K,,/Q) = (Z/mZ)*. It follows that K,,/Q is an abelian extension of 
degree ¢(m), where ¢ is the Euler ¢-function. 

If a € (Z/mZ)*, we denote by a, the corresponding automorphism. It is 
characterized by 


Fa(Cm) = Cm 


There are two important consequences of this. The first is immediate. 
Namely, o_; is complex conjugation on Ky. Indeed, a~1(Gn) = C71 = Cm, 
and Gm, by definition, generates Q(¢,,) over Q. The second consequence is 
that if p > 0 is a prime not dividing m, then co, is the Artin automorphism 
for the prime ideal pZ. To see this, we must first investigate ramification 
in K,, and learn something about the ring of integers O,, of Kin. 

Consider first the case when m = p® is a prime power. Set Cpe = ¢. Since 
¢ satisfies the polynomial f(x) = 2? —1 and f’(¢) = p®¢? —! one can 
deduce that K,-/Q is unramified at all primes different from p. We claim 
it is totally ramified at p and that the prime ideal lying above pZ in Op is 
just (¢ — 1). Here is a sketch of the proof. Let a € Z be relatively prime to 
p. There is ab € Z such that ab = 1 (mod p*®). One has 

Co — 


1 
ous el ed ae ae 
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and 
¢-1 (1 
(eT Cen] 
It follows that ¢* —1/¢—1 is a unit in O,,. Now, the irreducible polynomial 
in Q|z] for ¢ is 


_ (¢7)e-! 4 (C7)o-27 4 4 C44 


cP —] 


epo-* — | 


g(x) = = (xP POE (@P PoP fe pa EL 
The irreducible polynomial for ¢ — 1 is thus g(x + 1), which has constant 
term p. The other roots of g(x +1) are ¢* — 1, where 1 < a < p® and 


(a,p) = 1. Thus, 


€ 


P 
p= [J ¢*-1)=(¢-1)9 x unit . 


a=l 
(a,p)=1 


Passing to ideals in O,,, we find pO, = (¢—1)%"). Since [Km : Q] = ¢(m), 
this can only happen if (¢ — 1) is a prime ideal in O,,, which shows that 
pZ is totally ramified, as asserted. 

We continue to assume that m = p® and set ¢,- = ¢. Under these cir- 
cumstances we claim O,, = Z|¢]. To this end we note that the discriminant 
of the ring Z[¢] over Z is a power of p. This is a calculation (see Lang [5]). 
Note that Z|¢] = Z|¢ — 1]. Let w € Oye and write 


o(p*)—1 | 
W = S- ai(¢—-1)", a €Q 


1=0 


From the usual deduction via discriminants and Cramer’s rule we find that 
the rational numbers a; have denominators a power of p. We want to show 
the denominators are +1 so that the a; € Z. Let the least common multiple 
of the denominators be p” with n > 0. Then a; = b;/p” with the b; € Z 
and not all the b; divisible by p. We have 


Extend ord, to Kye by writing ord,(a) = $(p*)~*ord(¢_4)(q@) for all a € 
Kye. Let 19 be the smallest integer such that ord,(b;) = 0. Then, as is easily 
seen, ord, of the right-hand side of the above equation is ig/¢(p*) < 1. On 
the other hand, ord, of the left-hand side is > n. This shows n = 0 and it 
follows that all the a; are integers, as required. 

In the general case, write out the prime decomposition of m, m = 


Pips? --- py’. We require that m not be twice an odd integer. This is not 
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a big restriction since if mo is odd, Kom, = Km,. Then, Am is the com- 
positum of the fields K,,«, . It follows that all the p, ramify in K,, and all 
other primes are unramified in K,,. Moreover, using what we have shown 
in the prime power case it follows that Om = Z[¢,,] (see Lang [5]). 

We now can prove what was promised earlier. Namely, if p > 0 does not 
divide m, then the Artin automorphism of the prime ideal P = pZ for the 
extension K,,/Q is precisely 0, provided ptm. Let 8 be a prime ideal in 
Om lying over P. The Artin automorphism for P is characterized by the 
congruence 


(P, Km/Q) w=wX? (mod B) Yue Om . 


If p > 0 is prime and P = pZ, then NP = p. Since O,, = Z[G,,] we can 
check the congruence on elements of the form Sait, where the a; € Z 
and the sum is from 0 to ¢(m) — 1. We calculate 


or(>_ aiGn) = S- axch = (S- aicm)” (mod §) . 


This shows that (P, K,,/Q) = dp as asserted. 

From this fact about gp one can calculate the way primes in Z split in 
Km. If P = pZ is unramified, then P splits into ¢(m)/f primes of degree f 
in A,,, where f is the order of (P, K,/Q) in Gal(K,,/Q) (see Proposition 
9.10). Since we have shown Gal(K,,/Q) = (Z/mZ)*, the order of o, in 
Gal(K/Q) is the smallest integer f such that p! = 1 (mod m). 

We summarize a portion of this discussion in a theorem. 


Theorem 12.1. Let m > 0 be an integer not equal to twice an odd number. 
Let Gm € C be a primitive m-th root of unity and Km = Q(Gm). Then Km/Q 
is an abelian extension of degree ¢(m). The Galois group is isomorphic to 
(Z/mZ)*. A rational prime p is ramified in K», if and only if p|m. If 
p > 0 does not divide m, the Artin automorphism corresponding to the 
prime ideal P = pZ takes ¢,, to CP,. Let f be the smallest positive integer 
such that p!) = 1 (mod m). Then, P = pZ splits into ¢(m)/f primes of 
degree f in K,. Finally, if O,, denotes the ring of integers in Km, then 
Om = ZlCn)- 


The last thing about cyclotomic fields which we wish to discuss at this 
point is the behavior of the prime at infinity. The field of rational numbers 
Q has only one archimedean prime given by the usual absolute value. The 
field K,, is such that every embedding into C is complex since the only 
roots of unity in the real numbers R are +1. Consider the subfield K7, = 
Q(¢m + G71). This field is real and so is every embedding of it into the 
complex numbers. Moreover, it is of index 2 in K,, since ¢,, satisfied the 
quadratic equation x? — (¢, + ¢>!)r +1 = 0. Thus, the prime at infinity 
in Q splits into ¢(m)/2 real primes in K;* and each of these ramifies to 
a complex prime in K,,. It is clear that the Galois group of K,,/K;, is 
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generated by o_; which is complex conjugation. Thus, g_; can be thought 
of as generating the inertia group of the primes at infinity in Ky. 


Having now reviewed the cyclotomic theory in the number field case 
we will next consider how to construct an analogous theory in the function 
field case. Considering roots of unity will yield only constant field extensions 
which, as we have seen, are everywhere unramified. How can we generate 
abelian extensions which are geometric? The answer is not at all obvious. 
To provide the necessary background, we begin by exploring the notion of 
an additive polynomial over a field. 

Let k be a field. A polynomial f(x) € k{z] is said to be additive if inside 
the polynomial ring in two variables k[z, y] we have f(c+y) = f(x) + f(y). 
For any element a € k, f(x) = az is such a polynomial. We shall see that 
in characteristic zero this collection of homogeneous linear polynomials 
constitutes all additive polynomials. In characteristic p > 0 the polynomial 
T(x) = x? is additive, as is easily seen using the binomial theorem. It is 
easy to check that the set of additive polynomials is closed under addition, 
subtraction, multiplication by elements of k, and composition. The last is 
seen from the calculation 


f(g(x+y)) = flg(z) + g(y)) 
f(g(x)) + flay) = (f° g)(@) + f° g)y) - 
This leads us to additive polynomials of the form anv +a x2? +++++a,2? . 


With these we have exhausted the collection of additive polynomials as we 
now show. 


(fog)(z+y) 


Proposition 12.2. Let k be a field and f(x) € k[x| an additive polynomial. 
If the characteristic of k is zero then f(x) = ax for some a € k. If the 
characteristic of k is p > 0, then there are elements a, Ek witthO<i<r 
such that f(x) = apxz +a,2P +--+ a,x? . 


Proof. By definition, if f(x) is additive, f(x +y) = f(x) + f(y). Take the 
formal partial derivative with respect to x. Then, 0,f(x +y) = Ozf(z). 
Setting z = 0 we see that the formal derivative of f is a constant. If 
f(x) = Sobjz*, then f’(x) = S>ibj;x*—!. In characteristic zero this shows 
that f’(x) is a constant if and only if f(v) = bo +612, a linear polynomial. 
However, f(z + y) = f(x) + f(y) implies f(0) = 0. Thus, in this case 

Now, if the characteristic of k is p > 0, then f’(x) is a constant if and 
only if b; = 0 for all i > 1 with 7 not divisible by p. We may write 


f(a) = byx + So bpjx? = byw + g(x)? , 


j21 


where g(x) has coefficients in the field k, obtained from k by adjoining the 
p-th roots of the coefficients b,;. It is a simple matter to check that g(z) is 
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additive in k,[x]. By induction on the degree of f(x) we can assume that 
h 
9(Z) = Vendo Caz” . Thus, 


f(x) = br + S> cPaP ; 


h>0 
which is a polynomial of the required form since c; € & for all h. 


From now on, we assume that we are working in characteristic p > 0. 
Suppose that k is a field of characteristic p and let A(k) denote the set of 
additive polynomials with coefficients in k. A(k) is easily seen to be a ring 
with addition being given by the standard addition of polynomials and 
multiplication being given by composition (as is easily seen, A(k) is not 
closed under ordinary multiplication). We will reformulate the structure 
of A(k) in a more convenient manner by associating with every additive 
polynomial 


8 


Clearly, f(x) = g(T)(x) and the map f(x) — g(r) sets up a bijection 
between A(k) and k < 7 >, the ring of polynomials in 7 with “twisted” 
multiplication. This means that for all a € k we have 


Ta=aPr. (1) 
This follows from the calculation, 
(Ta)(xz) = T(ax) = (ax)? = aPax? = (aPr) (x) . 


Thus, multiplication in k < 7 > is just like that in a polynomial ring 
except that when multiplying an element of k by a power of 7 one must 
use the Relation 1. For obvious reasons, k < 7 > is often referred to as a 
twisted polynomial ring. It is now easy to check that the ring of additive 
polynomials with coefficients in k is isomorphic to k < 7 > under the map 
f(x) — g(r) given above. We will work primarily with k <7 >. 

It is possible, and desirable, to give a group scheme interpretation to this 
ring. Let G,/k be the additive group scheme over k. Among other things, 
G,/k assigns to every commutative k-algebra B the underlying additive 
group B,. Every additive polynomial gives rise to an endomorphism of By, 
in the obvious way. If u€ B and Sja;t* Ek <7 >, then 


(> air") (u) = S- aur 
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From these considerations it is not hard to show End(G,g/k) =k <7 >. In 
what follows, we will identify these rings. We will not need to invoke any 
facts from the theory of group schemes, but this point of view is illuminat- 
ing. All of the theory which we will develop is made possible by the fact 
that in characteristic p, End(G,/k) is a “big” ring. 

We need to make one modification in these definitions before returning 
to function fields. Let F be a finite field with g = p* elements. We want 
to work only with F-algebras and we want our endomorphisms to respect 
the F-algebra structure. So we assume F C k and only look at additive 
polynomials f(z) which commute with the elements of F. This requirement 
is that f(ax) = af (x) for alla € F. If f = > a;r* this requirement is easily 
seen to be equivalent to 


a? =a VaeF whenever a; £0. 


From the theory of finite fields, we see that these conditions hold if and 
only if s|¢ for all 2 such that a; # 0. Another way of saying this is that 
fek<r*® >. Note that r°(2) = £9. Since F will be fixed in our further 
considerations, we will redefine 7 to be the mapping which raises to the 
q-th power and write 


Endg(G,/k) =k<T>, 
where now the fundamental commutation Relation 1 will be replaced by 
ta=a'r Vaek. (2) 


As usual, set A = F[T| and k = F(T). 


Definition. A Drinfeld module for A defined over k will be an F-algebra 
homomorphism p: A > k <7 > such that for all a € A the constant term 
of po is a and, moreover, for at least one a € A, fa €¢ k. 


The notion of a Drinfeld module is much more general, but for the pur- 
poses of this chapter, this definition will suffice. The idea behind the defini- 
tion is that given a Drinfeld module p every commutative k-algebra B can 
be made into an A algebra in a new way. B is already an A-algebra since 
A is a subset of k and B is a k-algebra. However, given p we can define a 
new multiplication by 


a-u=p,(u) VaeA and VWeB. 


The condition that p, ¢ k for at least one a € A is to insure that this 
action is indeed different from the standard action of A on B. We will call 
B with this new A-module structure, B,. The k-algebra which will receive 
the most attention is the algebraic closure of k, k. 

We have said nothing yet about the existence of Drinfeld modules. In 
the general case (which we have yet to define) this is a delicate question. 
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Here, however, it is a trivial matter. A is generated freely as an algebra 
over F by one element T. Thus, for any element h € k < 7 > there is 
a unique homomorphism from A to k < +t > which takes T to h. We 
must only make sure that the constant term of h is T and that h ¢ k to 
get a Drinfeld module. Perhaps the simplest choice for h is 7+ 7. The 
resulting Drinfeld module is called the Carlitz module, C. Thus, Crp = 
T+, Cr =T*+(T+T7%)7 4+ 7?, etc. Carlitz discovered and exploited 
this module decades before anyone else had any idea of the value of this 
construction. The reader may wish to consult the papers by Carlitz [1, 2]. 
In these papers Carlitz actually works with the module C” defined by the 
relation C’(T) = T — 7. In almost all modern treatment the plus sign is 
chosen. We shall stay with this convention. 

We will discuss the properties of the Carlitz module in some detail, but 
for a while we will continue to develop the theory more generally. 

Suppose p is a Drinfeld module and 


pp =T+eatt+eoT? +++: +07" , 


where the c; € k and c, # 0. Using pr2 = prpr, we see that the constant 
term of pr2 is T* and that the highest power of 7 that occurs is 2r and 
the leading coefficient is c, raised to the power 1 + q”. Continuing this 
way we find that the constant term of prn is T” and the highest power 
of r that occurs is nr and the leading coefficient is c, raised to the power 
D+q+q77 +---+ qq)", Using these comments and the fact that p is 
an F-algebra homomorphism we find for a € A that the constant term of 
Pa is a and the degree in 7 of p, is rdeg(a). It is important to note that 
the degree of the polynomial p,(z) is q” 9°‘). Under these conditions, we 
say that the Drinfeld module p has rank r. We shall now see how the rank 
plays an important role in the theory of the A-module kp. 
Let’s consider the A-module kp and its torsion submodule: 


Ap ={XEk| pal(A) = 0 for somea€ Aja #0}. 
For any a € A, a £0, we define the submodule A,[a] C A, as follows: 
Apa] = {1 €& | pa(A) = 0}. 


It is possible to identify the A-module structure of these modules with 
some precision. The following abstract lemma is the key. 


Lemma 12.3. Leta € A, a £4 0. Let M be an A-module and suppose 
for each bla that the submodule M[b] = {m € M | bm = 0} has q” deg(b) 
elements. Then 


M{a] & A/aA@A/aA®::-@A/aA r times. 


Proof. Consider the prime decomposition of a, a = aP;' Ps? .-- Pf*, where 
a € F* and the P; run through the monic, irreducible divisors of a. M[a] is 
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isomorphic to the direct sum of the submodules M[P,"|. Via the Chinese 
Remainder Theorem, it suffices to consider the case that a = P® is a prime 
power. 

So, suppose a = P® is a prime power. Since M|[P] is a vector space over 
A/PA with q’ 48) elements, by hypothesis, it follows that the dimension 
of M[P] over A/PA is r (recall that A/PA has q*°&” elements). It follows 
from the structure of modules over principal ideal domains that M|P*] is 
a sum of r cyclic submodules, 


M[P*?) 2 A/P"A@A/PPA®::-@A/PIA. 


One must have f; < e for 1 <1<_r. The number of elements in the right- 
hand side of this isomorphism is g to the power (f; + fa+-::+f,) deg(P). 
The number of elements in the left-hand side is, by hypothesis, q to the 
power redeg(P). These two numbers being equal implies that each f; = e 
and this concludes the proof. 


Proposition 12.4. Let p be a Drinfeld module of rank r, 1.e., for each 
a € A the degree in T of pa is rdeg(a). Then, for eacha € A,a #0 we 
have 


A,|a] = A/aA@ A/aA®:--@A/aA r times. 


For the module A, we have the isomorphism 
A, =k/A@®k/A®::-:@k/A_ r times . 


Proof. We apply Lemma 12.3 with M = ko. We have to check that for 
each a £0 in A that A,[a] has q”¢°8 elements. From our previous work 
we see that p,(xz) has the form 


r deg(a) 


pa(x) = ax + byx% + bor? + +++ + Brdeg(ay®! : 


where the b; € k and 6, deg(a) # 0. The derivative of pa(x) with respect 
to z isa #0. Thus, p,(2z) is a separable polynomial and in k has gq” ¢¢2(@) 
distinct roots. These roots are exactly the elements of A,/a] so the first 
part of the proof is complete. 

The second assertion is a formal consequence of the first. Since we won’t 
use it in what follows, we merely sketch the proof. Note first that A, is the 
union of the submodules A,[a] as a runs through the non-zero elements of 
A. Secondly, since A/aA & a~!A/A we can rewrite the first isomorphism 
as 


A,[a] Xa7'A/A@a'A/A®-:-@a tA/A_ r times. 


Order the non-zero elements of A by divisibility. The result would follow 
if we could pass to the direct limit and this process could be done in such 
a way that the direct limit could be interchanged with the direct sums. 
One can arrange the direct sum decompositions so that this is possible. 
However, we will omit the details. 
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Suppose we adjoin the elements of A,/a] to k to form the field Kp := 
k(A,[a]). Since, as we have seen, pa(x) is a separable polynomial and Apia 
is the set of roots of this polynomial, it follows that K,./k is a Galois 
extension. Since pa(x) € k{x] we see pq(A) = 0 implies pg(aA) = 0 for all 
o € Gal(K,,a/k). For such o it is easy to check that not only does o map 
A,[a] into itself, it actually induces an automorphism of the A/aA-module 
structure. We thus get a map, in fact a homomorphism, from 


Gal(K,pa/k) > Autasaa(Apla}) . 


Since A,|a] generates K,., any automorphism inducing the identity map 
on A,[a] must be the identity automorphism. Thus, the kernel of the above 
map is trivial. 

Finally, by the first assertion of Proposition 12.4, we see that 


Aut4/aa(Ap|a]) = GL,(A/aA) . 


We have proved the following proposition. 


Proposition 12.5. Define Kp. to be the field k(A,|a]). Then K,a/k is a 


Galois extension and there is a monomorphism 


Gal(Ky,a/k) + GL,(A/aA) . 


Corollary. [f p has rank 1, then K,.4/k ts an abelian extension. 


Proof. This is immediate from the Proposition since GL;(A/aA) = (A/aA)* 
which is abelian. 


One can ask about the size and nature of the image of the maps given in 
the Proposition. This is a very difficult question in general. Much remains 
to be discovered. Using a lot of sophisticated machinery some answers have 
recently been given by Richard Pink (see Pink [1]). Here, we will be con- 
cerned with a very special, but interesting case. Namely, the case of the 
Carlitz module. 

Recall that the Carlitz module is characterized by Cr = 7'+T or equiv- 
alently Cp(z) = Tx+-2?. Clearly the Carlitz module has rank 1 and so, by 
the corollary to Proposition 12.5, adjoining torsion points for the Carlitz 
module to k gives rise to abelian extensions. We will investigate these exten- 
sions in some detail and show that they have remarkably similar behavior 
to cyclotomic extensions of Q. 

Since the Carlitz module will be the focus of our work for the rest of this 
chapter, we will write A for Ac , Aq for Ac[a], and K, for Kc,q. Also, to 
emphasize the relation to our discussion of cyclotomic fields we will use the 
letter “m” from now on rather than “a” as our typical non-zero element 
of A. The fields Ky, = k(Am) will be the analogues of cyclotomic number 
fields. We define them to be cyclotomic function fields. 
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By Proposition 12.5 and its corollary we see that Gal(Ky,/k) is iso- 
morphic to a subgroup of (A/mA)*. Our first goal will be to show it is 
isomorphic to all of (A/mA)*. Before doing this it will be necessary to 
make a number of preliminary observations. 

Notice that Com = CaCm = aC, for all a € F. It follows thats that 
Aam = Am for all a € F*. Another way of saying this is that the torsion 
points A,,, depend only on the ideal mA and not on any particular generator 
of this ideal. 

Let m be a polynomial of degree d. Then 


Cyn (x) = [m, O]x + [m, Ix? + [m,2]x% +++ +[m,djz™ , (3) 


where |m, 2] € A for every i, [m,0] = m, and [m, d] is the leading coefficient 
of m. Note that if m is monic, then [m,d] = 1. The degree of C,,(x) is 
q? = |m| (see Chapter 1 for a discussion of this notation). Later we will 
show that as a polynomial in T, [m,i] has degree q‘(d — 1). It is a good 
exercise to compute C,,,(x) explicitly for a few polynomials m of small 
degree to get a feel for the nature of the coefficients [m, i]. 

It will turn out that ifm = P, a irreducible polynomial, then Cp(x)/z is 
an Eisenstein polynomial at P (i.e., the leading coefficient is not divisible by 
P, all the other coefficients are divisible by P, and the constant term is not 
divisible by P*). Thus, Cp(x)/x € A[z] is analogous to [(1+ 2)? —1]/x € 
Z|r] in the classical cyclotomic theory. It follows that 0 4 Ap € Ap is 
analogous to ¢, — 1, not ¢,. 

From Proposition 12.5, we know that Am & A/mA as an A-module. Let 
Am be a generator of this module. Then, it is easy to see that Co(Am) 
is a generator if and only if (a,m) = 1. This shows that A,, has ®(m) 
generators where ®(m) is the analogue of the Euler ¢-function for the ring 
A. By definition, 6(m) is the number of non-zero polynomials in A of degree 
less than that of m and relatively prime to m. Alternatively, ®(m) is the 
number of elements in (A/mA)* (see Chapter 1). 

Since A, is a generator of A,, it follows that K,, = k(Am). Let Om 
denote the integral closure of A in Ki. 


Proposition 12.6. Let Am € Am be a generator and suppose a € A is 
relatively prime tom. Then, Ca(Am)/Am is a unit in Om. If m is divisible 
by two or more primes, then Am is itself a unit. 


Proof. From Equation 3 we see that A,, is integral over A. From the same 
equation, replacing m by a, d by deg(a), and substituting x = Am, we see 
that Ca(Am)/Am € Om. We must show the reciprocal of this element is 
also in O,,. 

Let b € A be such that ba = 1 (mod m). There is an element f € A such 
that ba = 1+ fm and we have C,C, = 1+ CyC,. Applying this to A», 
yields Cy(Ca(Am)) = Am. Thus, 


204 Michael Rosen 


To prove the second assertion, it is no loss of generality to assume m is 
monic. Suppose m = m,m2, where m, and mg are monic and relatively 
prime. Set Am, = Cm,(Am) and Am, = Cm,(Am). Then Am, is a primitve 
m,-th torsion point for 1 = 1,2. For all a € A, Ca(zx) is divisible by a. 
Consider the factorization 


This shows that A, divides \,,, and similarly A,, divides Am, in Om. 
Taking norms from K,, to k shows that the norm of A, divides a power of 
Nx, /k(Am,) for 1 = 1, 2. 

To finish the proof one does induction on the number of distinct primes 
dividing m. We will need the corollary to Proposition 12.7, which will be 
proven shortly. Its proof is independent of what we are doing here, so it is 
legitimate to use it. The corollary implies that if m= P® is a prime power, 
then the norm of Ape is P. Suppose m is a product of two prime powers 
Py" and P;?. Then, from what we have proven, it follows that the norm 
of A,, divides a power of P; and a power of P,. This implies the norm of 
Am is a non-zero constant and so A», is a unit. If m is divisible by t > 2 
distinct primes, set 


t 
my, = Py and m2= Ea , 
i=2 


Then, by induction, A,,, is a unit and its norm is a non-zero constant. By 
what we have proven above, it follows that the norm of .,, is a non-zero 
constant. ‘Thus, A,, is a unit, and we are done. 


With the aid of these units we will imitate some of the deductions of the 
classical theory. As there, we begin by considering the case when m = P® 
is a power of an irreducible polynomial P. Since Ape © A/P*A an element 
A € Ape is a primitive generator iff Cpe(A) = 0 and Cpe-1(A) 4 0. Thus, 
the primitive generators are precisely the roots of the polynomial 


Cpe(z)  Cp(Cpe-1(z)) 


C'pe-1(x) Cpe-1(2) 
P+[P,iJCpe-1(x)?-1 +--+ + [P,d]Cpe-1(x)%~" , (4) 


| 


where d = deg(P). The degree in x of the polynomial in Equation 4 is 
|P\e-1(q? — 1) = |P\*-!(|P| — 1) = 6(P®) as it should be. 

Proposition 12.7. Let P € A be a monic irreducible polynomial and 
e € Z,e > 0. Then, Kpe is unramified at every prime ideal QA with 


QA # PA. The prime PA is totally ramified with ramification index b(P°). 
Consequently, 


[Kpe : k] = @(P°) and Gal(Kp./k) & (A/P*A)* . 
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Finally, the prime ideal above PA is (A) = AOpe where X is any generator 
of Ape . 


Proof. Let \ be a primitive generator of Ape and let g(x) € k[z| be the 
monic irreducible polynomial it satisfies. Then g(a) must divide Cpe(z). 
Write Cpe(x) = f(x)g(x). Differentiate both sides and substitute x = 4. 
The result is P® = f(A)g'(A). Since Kpe = k(A), it follows that g’(A) is 
contained in the different of Op. /A. Thus any prime ideal of Ope dividing 
the different, must contain a power of P and thus P itself. This shows that 
PA is the only possible prime ideal in A ramified in Ope. 

Let d = deg(P). As we have seen, the other primitive generators of Ape 
are 


{Ca(A) | 0 < deg(a) < deg(P*) = ed and (a, P) = 1}. 


These are the roots of the polynomial in Equation 4, which is monic (since 
P is assumed to be monic) and has constant term P. By the first part of 
Proposition 12.6, we deduce 


P= [J Gd) =A?) x unit . 


a,deg(a)< ed 
(a,P)=1 


It follows that PA = (\)®(P"), Let $B be a prime ideal of Ope dividing (,). 
Then, the ramification index of %/P is divisible by ®(P*). Since A is a 
root of the polynomial in Equation 4, we know that [Kpe : k] < ®(P°). It 
follows that the ramification index of 8/P is precisely (P°), that POpe = 
pe(P") and that $B = (A). The remaining assertions are now clear. 


Corollary. Let P € A be monic irreducible of positive degree, and e € 
ZL, e > 0. Let » be a generator of Ape and g(x) € kia] its irreducible 
polynomial. Then g(x) is an Eisenstein polynomial at P. 


Proof. By what has been proven in the proposition, g(x) is the polynomial 
which appears in Equation 4. We have 


g(x)= |] (@-C.(d)), 


(a,P)=1 


where the product is over all primitive generators of Ape. 

Except for the leading coefficient, which is 1, the coefficients of g are 
the elementary symmetric functions of the primitive elements in Ape. The 
proposition shows these are all in the ideal (A). Thus, all the coefficients of 
g(x), except the leading coefficient, are in (A)N. A = PA. Since the constant 
term is P, it follows that g(x) is an Eisenstein polynomial. 


Using the above corollary and an easy induction on e, we see that Cpe (x) 
is a product of Eisenstein polynomials at P. Consequently, all its non- 
leading coefficients must be divisible by P. In other words, we have shown 
P | [P°, 2] for all 0 <i < ed where d = deg(P). 
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Having dealt with the case m = P®, we now pass on to the general case. 
Let m € A be a polynomial of positive degree and let m = aPy! Py? - ++ Py 
be its prime decomposition. 


Theorem 12.8. K,, is the compositum of the fields K pe. The only ideals 
in A ramified in O, are P;A with1 <1<t. We have {Kin : k] = ®(m). 


More precisely, 


Gal(K,,/k) & (A/mA)* 


Proof. Define m; to be m divided by P**. Let Am be a generator of Aj, as 
an A-module. It is clear that C,,(Am) is a generator of Ape. Set Apes: = 
Cm, (Am). 

Clearly, Kpe. = k(Aps.) C k(Am) = Km. Thus, Ky, contains the com- 
positum of the fields K pe .,forl <i<t. 

Since the greatest common divisor of the set {m; | 1 <7 < t} is just 1, 
there exist polynomials a; € A such that 1 = an a,;m,. It follows that 
1 = ar Ca,Cm,. Applying this relation to the element A,, yields 


\m = » Ca, (Ape) - (5) 


This shows that A,, is in the compositum of the fields K pe) which com- 
pletes the proof that K,, is the compositum of these fields. 

If P is a prime element such that PA 4 P;A for any 1, then by Proposition 
12.7, PA is unramified in every K pe. and so must be unramified in their 
compositum K,,. On the other hand, P,A is totally ramified in K pe by 
the same proposition. Thus, all the ideals P,A are ramified in Ky. — 

We will prove that [K,, : k] = ®(m) by induction on t. For ¢ = 1 this 
assertion is part of Proposition 12.7. Assume the result is true for t — 1. 
Then, [Km, : k] = ®(mz). Now, Kn, 1K pe = k because Ky, is unramified 
at PA and K pet is totally ramified at PA. It follows that 


[Kin 2k] = (Km, | k][Kpee : k] = ®(m1)®(Py*) = O(m) . 


Finally, we know from the corollary to Proposition 12.5 that there is 
a monomorphism from Gal(K,,/k) to (A/mA)*. Since we now know that 
both of these groups have the same order, &(m), it follows that this monomor- 
phism is an isomorphism. 


Our next task is to investigate how the primes in A split in O,,. To 
do this we have to look at the isomorphism Gal(K,,/k) = (A/mA)* more 
closely. 

We first recall how this isomorphism is defined. As usual, let 4, denote 
a generator of A,, as an A-module. If o € Gal(K,,/k), then clearly om is 
another such generator. Thus, there is an a € A with (a,m) = 1 such that 
o(Am) = Ca(Am). The automorphism o is completely determined by this 
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relation since \,, generates K,, over k. Note that a is determined only up to 
a multiple of m. We write o = o,. The map o — a is the isomorphism from 
Gal(Km/k) — (A/mA)* which we have been discussing. The content of 
Theorem 12.8 is that for any a € A, relatively prime to m, there is a unique 
automorphism og € Gal(K,,/k) such that agAm = Ca(Am). The important 
fact that we are after is that when P is a monic, irreducible polynomial 
not dividing m, then op = (PA, Ky,/k), the Artin automorphism for the 
prime ideal PA. The next proposition, interesting in itself, will be a useful 
tool. 


Proposition 12.9. Let O,, be the integral closure of A in Km. Then, 
Om = A[Am|]- 


Proof. We first consider the case when m = P*®, i.e., when ™m is a prime 
power. 

For the moment, let’s drop the subscript and set Ape = A. We have that 
A{A] C Ope and we want to show equality holds. Let g(x) € k{x] be the 
irreducible polynomial for A. We showed at the beginning of the proof of 
Proposition 12.7 that g’(A) divided a power of P in Ope. It is astandard fact 
about Dedekind domains that the discriminant of the A-order A[A] C Kpe 
is the norm from Kp. to k of the element g’(A) (see Serre [2]). It follows 
that the discriminant of A[A] is a constant times a power of P. 

Let w € Ope. Then 

@(P*)—1 
W) = 3 ar , 
4=0 
where a; € k for 0 <2 < (P°). Using the fact that the discriminant 
of A[A] is a constant times a power of P, we see in the usual way that 
each a; is if the form b;/P", where b; € A and n can be chosen so that at 
least one of the b; is not divisible by P. We want to show that under these 
circumstances, n = 0, so that w € A[A]. From the last equation, we find 


6(P*)-1 


P™w= So dBA‘. (6) 
t=0 


Extend the additive valuation ordp from k to K,, by the equation ordp = 
(P°)~tordy). Recall that by Proposition 12.7, (A) is a prime ideal and it 
is totally ramified over PA. Thus, this procedure makes good sense. Also, 
ordp(A) = 1/®(P*). 

In Equation 6, ordp of the left-hand side is > n. Let ig < ®(P*) be 
the smallest non-negative integer such that ordp(b;,) = 0. A moment’s 
reflection will then establish that the zo-th term on the right-hand side of 
Equation 6 has the smallest valuation of all the terms. Thus, the ordp of 
the right-hand side is i9/®@(P*°) < 1. It follows that n = 0 and sow € A[A| 
as asserted. 
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To handle the general case suppose m = aP;'P5?---P,* is the prime 
decomposition of m in A. One can show that O,, is the ring compositum 
of the subrings O pe = = A[\pe«.]. We omit the details of this, but refer the 
reader to Proposition 17, Chapter 3 of Lang [5] where the analogous result 
is proven in the number field case. Since A[A pe. ] C A[Am] for all 1 <i <t 


we have O,, C Al[Am] C Om 


We are now in a position to give a fairly short proof of the following 
important result. 


Theorem 12.10. Let m € A have positive degree and let P € A bea 
monic, irreducible polynomial not dividing m. Then, the Artin automor- 
phism of the prime ideal PA in the extension K,,/k is the automorphism 
op which takes Am to Cp(Am). Let f be the smallest positive integer such 
that Pf =1 (mod m). Then, PO, is the product of ®(m)/f prime ideals 
each of degree f. In particular, PA splits completely iff P=1 (mod m). 


Proof. Since P does not divide m, PA is unramified in K,. Let $B be 
any prime ideal in O,, lying above PA. Then, the Artin automorphism is 
characterized by 


(PA, Km/k)w =w'lP! (mod P) Ww € On 


This is because the norm of the ideal PA is the number of elements in 
A/PA which is q¢8‘?) = |P}. 

As we have seen, the irreducible polynomial for Ap is Cp(x)/x. By the 
Corollary to Proposition 12.7, Cp(x)/x is an Eisenstein polynomial. Also, 
it is monic since we are assuming that P is monic. Thus, 


Cp(z) =2'?! (mod P) , 


and this congruence continues to hold modulo $ since P € $B. Conse- 
quently, 
op\m = Cp(Am) = NE! (mod $8) . 


Let w € O,. By Proposition 12.9, w = S\a;A1,,, where a; € Aand 0 <i< 
d(m). Thus, 


opw = bail opAm) = Darn = | (Daim) IPL =wIPl (mod $B) . 
(7) 


We have used the fact that |P| is a power of the characteristic p of k and 
Fermat’s little theorem for polynomials; i.e., al?! = a (mod P) for alla € A 
(see the corollary to Proposition 1.8). 

Equation 7 establishes the first part of the theorem, namely, (PA, 
Ky,/k) = op. For emphasis, we mention again that this equality is only 
true when P is a monic irreducible. 

The last part of the Theorem follows from the standard property of 
the Artin automorphism of a prime. Namely, if f is the order of the Artin 
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automorphism, then the prime splits into ®(m)/f primes of degree f. From 
the isomorphism Gal(K,,/k) = (A/mA)*, we see that the order of op is 
the smallest positive integer f such that P! = 1 (mod m). This completes 
the proof of the theorem. 


The last task in this chapter is to investigate the way in which the prime 
at infinity of k splits in the extension K,,. To do this we will need some 
preliminary work. First we need to know the degrees of the polynomial 
[m, 7] € A which are the coefficients of C,,(x). Secondly, we need a descrip- 
tion of the completion of k at oo. Finally, we will need an elementary, but 
powerful, technique of non-archimedean analysis, the Newton polygon. 


Proposition 12.11. Let Ciy,(x) = yr olm, ijx? where each coefficient 
[m,i] € A and d = deg(m). Then the degree of [m,i] as a polynomial in T 
is q'(d —1). 


Proof. If i > d, we set [m,i] = 0. Notice that [m,0] =m which has degree 
d = q°(d —0). This shows the result is true for i = 0. For the rest of the 
proof we assume 2 > 0. 

We first investigate the special case m = TIT” and proceed by induction 
on n. For n = 1 we have [T,0] = T of degree 1 = g°(1 — 0) and [7,1] = 1 
of degree 0 = q!(1 — 1). Thus, the result is true for n = 1. 

To go further, we first derive a recursion formula for the coefficients 
[T”, 2]. Consider the equation 


Cpn(x) = Cp(Cpn-1(x)) = TC pn-1 (x2) + Cpn-1(x)? . 
By isolating the coefficient of x? on both sides, we find 
[T”, 7) =T(T" 1,4 (Tt t-12. 


By induction, the degree of the first term on the right-hand side is 1 + 
q'(n — 1 —7) and the degree of the second term on the right-hand side is 
q'(n — 1). Since we are assuming 7 > 0, the second term has larger degree 
and it follows that [T”, i] has degree q*(n — 1). 

Finally, if m = ar aj;T) with each a; € F and ag £0, then 


from which one sees that deg([m,i]) = q*(d — 7%) since ag[T%, i] is the 
non-zero term of largest degree. 


We now turn our attention to the completion of k at infinity. It is useful 
to first give a discussion of the completion of k at the prime corresponding 
to the monic irreducible polynomial T, i.e., the completion of k at zero. 
Every element of h € k can be written as a power of 7’, T”, say, times a 
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quotient f(T)/g9(T) of polynomials, both not divisible by 7’. Under these 
circumstances, ordr(h) = n. We can give k a metric space structure by 
setting |hy — help = q7ordr (41-2), where g = #F. Two polynomials will 
be close in the resulting topology if their difference is divisible by a high 
power of 7’, i.e. if their initial coefficients coincide. Thus, a Cauchy sequence 
of polynomials will give rise to a uniquely defined power series in 7’. The 
completion of A in this topology, algebraically and topologically, is the ring 
of formal power series, F|/T]], where the topology is given by the powers 
of the maximal ideal (T). The completion of k is just the quotient field 
of F{[7]], which is called the field of formal Laurent series. We denote this 
field, F((T’)). A typical element has the form 


[ee 
3 a;I°*, where a,€F. 
i=—N 


To get a good description of the completion of & at infinity the trick is 
to replace T by 1/T in the above analysis. To see this, recall that for a 
polynomial f(T) in T we have, by definition, ord. f(T) = — deg f(T). Let 
d = deg f(T). We can write f(T) = T¢h(1/T) where h is a polynomial with 
non-zero constant term. If we set U = 1/T, then the monic irreducible U of 
the ring F{U] = F({1/T] defines a discrete, rank 1 valuation of k and clearly, 
ordy f(T) = ordyU~¢h(U) = —d. It follows that the two valuations ord, 
and ordy coincide on A and so they must coincide on k, i.e., they are equal. 
As a consequence of our previous discussion of the completion of k at zero 
we can now assert that the completion of k at infinity is the ring of formal 
Laurent series in U, i.e., 


Koo = F((U)) = F((1/T)) . 


The elements which are regular at infinity are the power series in 1/T, 
F((1/T]], and the units at infinity are the power series in 1/T’ with non-zero 
constant term. If 0 4 g € F((1/T)), then we can write g = (1/T)h, where 
h is a unit in F{[1/T]]. In this situation, ordaog = N. 

We shall return to k, shortly, but first we will describe the method of 
the Newton polygon. This method enables us to find information about 
the roots of a polynomial with coefficients in a field L, which is complete 
with respect to a discrete rank 1 valuation v. We denote by ord, the cor- 
responding ord function. Let 


d 
f(x) = S- aja! E L{z] , 
j=0 


be a polynomial and assume that agag # 0. Consider the set of points 


S, = {(, ord,a;) ER*|0<j<d, a; #0}. 
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Above each point in Sr erect a vertical ray and then take the convex hull of 
the resulting set. This convex hull is bounded on the sides by two vertical 
rays and below by a polygonal path connecting (0, ord,ao) with (d, ord,aq). 
This polygonal path is defined to be the Newton polygon of f, Nf. 

Let L be an algebraic closure of L. We continue to use the notation ord, 
for the unique extension of ord, to L. 


Theorem 12.12. Let L be a field which is complete with respect to a 
discrete, rank 1 valuation v. Let ord, be the corresponding ord function 
extended to L, an algebraic closure of L. Let f(x) = 0 a;x) € L{x} be 
a polynomial with agag #0. Let l be a line segment of the Newton polygon 
of f joining (j,ordya;) with (h,ordyap,) with 7 < h. Then f(x) has exactly 
h—j roots y in L such that ordyy is the negative of the slope of | . 


We shall postpone the proof of this theorem until the end of the chapter. 

To give some idea of its usefulness, let’s apply it to the case where f(z) 
is an Eisenstein polynomial, i.e., ord,ag = 0, ord,a; > 0 for 0 <7 < d, 
and ord,a9 = 1. One sees, without effort, that the Newton polynomial 
of f is just the line segment joining (0,1) with (d,0). It follows that f 
has d roots y such that ordyy = 1/d. Since an Eisenstein polynomial is 
irreducible, it follows that adjoining any root of f to L results in a totally 
ramified extension of degree d. Other applications of this nature are easy 


to produce, but we leave these aside and proceed to apply the method to 
the case f(x) = Cm(x) € kao [a]. 


Proposition 12.13. Let C,,(x) € k[z] C koo|ax] be the Carlitz polynomial 
corresponding to m € F[T] of degree d. Let kg be an algebraic closure of 
kx. and continue to denote by ordg, the unique extension of ordo, to Koo: 
Then, for each 1 <i <d, there exist exactly q’ — q*=1 roots X of Cm(zx) 
such that 1 
ordgA =d—-i- ——. 
q-—1 

Proof. Recall that Cy(x) = ae [m,i]2?. By Proposition 12.11, 
ord.o(m, 1] = — deg [m, i] = —q'(d —1). 

To apply Theorem 12.12, we first divide C,,(x) by x to get a polynomial 
with non-zero constant term. The points to consider in the construction of 
the Newton polygon of C,,(x)/ax are 


{(¢’ —1,-q’(d—39)) |O< 7 Sd}. 


The lines connecting successive points all have different (and increasing) 
slopes, so the Newton polygon of C,,(x)/x consists of just these line seg- 
ments. Connecting the 7 — 1’st point with the i-th point gives the slope: 


wa (d=) (ed dt +) gy yp 
q —4q 


q-1- 
The proposition now follows from Theorem 12.12. 
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Corollary. There are q —1 roots dN of Cm(x) in kg such that ordes N= 
d—1-—1/(q—1). For each such root, we have M1 € kg 


Proof. The first assertion is a special case of the proposition corresponding 
toz=1. 

The monomials which occur in C,,(x)/x with non-zero coefficients all 
have the form 2? ~!. Thus, C,,(z)/x = f(w), where w = x7~ 1 and f(w) € 
k[w]. The roots of f(w) in k,. are {A7~'} where \ runs through the roots 
of Cm(z). The map \ > AI7! is q— 1 to 1, since whenever d is a root of 
Ci, (x) so is ad for any a € F*. It follows that f(w) has exactly one root 
with orday = (¢q—1)(d—-1)—1. 

Let o be any automorphism of kg, leaving koo fixed. Then, cy is also a 
root of f(w). Since orday = ordwoay, we must have oy = ¥. Since oa is 
arbitrary, it follows that 7 € koo, as asserted. 


Using the Carlitz action, k,. can be made into an A-module in exactly 
the same way that we made k into an A-module. Namely, if a € A and 
u € ko then we define a: u = C,(u). If m € A is of positive degree, we 
denote the m-torsion points, kog[m], by Am 

Let e denote a fixed field isomorphism o over k from Ky, to koo. Since 
K,,/k is a Galois extension, all field isomorphisms over k from K,, to kiso 
are of the form 100 with o € Gal(Kp/k). 

The isomorphism ¢ corresponds to a prime 8. of K,, lying over oo. To 
see this, let O. = {w € Km | ordw > 0}. It is easy to see that Og is a 
discrete valuation ring inside K,, which contains F and has quotient field 
Ky». By definition this is a prime of K,, which we denote by Boo, its max- 
imal ideal. The proof of the fact that $B, lies above oo is straightforward. 

Suppose that A is a root of Cy,(x) in k. Since C,(A) = 0 implies 
C,,(L\) = 0, we see that « maps A,, to Am. This map is an A-module 
isomorphism. By Proposition 12.13, there is an element Am € Am such 
that ordgAm = d—1-—1/(q—1). Let Am € Am be such that vAm = Am 


Theorem 12.14. Let J = {og € Gal(Km/k) | a € F*} and set Ky, equal 
to the fixed field of J. Then o0 splits completely in K* and every prime 
above oo in K;* is totally and tamely ramified in Km. 


Proof. The proof will proceed in steps. 


Step 1. The first thing to do is to show that \m is an A generator of 
Am. Suppose a € A — {0} of degree less than d. Then, 


dega 
~ ~ 4 ~ 1 

ordgoCa(Am) = Ordoo (S- [a, 8 = ordaadAm = d—dega—1— 71° 

i=0 

- (8) 

To justify the second equality, one has to show that the term a.,, in the 
sum is the one with lowest ord. We leave this straightforward calculation 
to the reader. 
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If Am were not an A-generator of A,, there would be a proper divisor m1 
of m such that Cym,(Am) = 0. Equation 8 shows this is impossible. ‘Thus, 
\m is an A-generator of A,,, as asserted. Moreover, since A,, and A,, are 
A-isomorphic via v, it follows that A,, is an A-generator of Aj. It follows 
that koo(Am) = Koo(Am) and Km = k(Am) = k(\m): 

Step 2. We show that KK = k(\%71). Let og € J. Then, oa(Ad7*) = 
(TaAm)? + = (AAm)?~! = AT1. Since a € F* is arbitrary, it follows that 
AT! © Kj. Since Am is a root of r?-! — \97! € K7,[x] we have [Kim : 
k(A%~")] < q—1. By Galois theory, [K,, : K+] = q —1. It follows that 
Kk = k(\971), as claimed. 

Step 3. From Step 2, we see that u(Kw) = u(k(A97!) C ko (AST "). By the 
corollary to Proposition 12.13, we have \2>! € kyo. Thus, (K+) C koo. It 
follows that co splits completely in K>, which proves the first part of the 
theorem. 


Step 4. We claim that the extension k 90 (A m)/ Keo is totally and tamely 
ramified of degree q-- 1. Let y = A271. Then, Am satisfies 297! —y € koo{a]. 
Thus, [koo(Am) : Koo] < q—1. On the other hand, ordgoAm = d—1—1/(q—1) 
SO the ramification index of the extension is at least g — 1. It follows that 
the degree of the extension is g — 1 and the ramification index is q — 1, 
which is what was to be proven. The ramification is tame, since q — 1 is 
prime to the residue field characteristic which is p. 


Step 5. Let $B, be the prime of K,, discussed earlier and p,, be the prime 
of K; lying below it. The completion of K+ at poo is kop by Step 3. The 
completion of Km, at Boo is ko (Am): Thus, by Step 4, Boo/Poo is totally 
and tamely ramified of ramification degree g —1. The other primes over oo 
behave the same way since K,,/k is a Galois extension. 


Corollary. For all m € A, m # 0, the constant field of Ky», is F, 
Ky,/k is a geometric extension. 


Proof. Since f(p./oo) = 1, the residue class field at p. is F. We have 
f(Boo/Poo) = 1 since Boo/po. is totally ramified. Thus the residue class 
field of $B. is also F. Since the constant field of K,, injects into the residue 
class field of Boo, the result follows. 


Since the properties of K;>, are so similar to those of Q*, we call K;* the 
maximal real subfield of K,,. The point is that the prime at infinity of k 
splits completely in K;{ and every prime above it in K* ramifies totally 
in Km. This is just the behavior of the prime at infinity of Q, the only 
archimedean prime. It splits completely in Q*, and every prime above it 
ramifies totally in Q,. Also notice that the Galois group of K,/K;, is 
isomorphic to F*, the non-zero units of A, whereas the Galois group of 
Qm/Q*, is isomorphic to {+1} the non-zero units of Z. 

In general, we will call a finite extension K of k real if oo splits completely 
in K. For example, the theory of quadratic function fields (quadratic ex- 
tensions of k) is divided up into the theory of real quadratic function fields, 
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the case where oo splits, and complex quadratic function fields, the case 
where oo is either inert or ramifies. We will discuss this in greater detail 
later. 


Having described in some detail the cyclotomic function fields, K,, = 
k(Am), we will give a sketch of the result of Hayes which is the function 
field analogue the Kronecker-Weber theorem. The latter theorem states 
that every finite abelian extension of the rational numbers Q is contained 
in a cyclotomic field Q(¢,) for some positive integer n. For a discussion 
of the history of this theorem and a proof see Chapter 14 of Washington 
[1]. It cannot be true that every abelian extension of k = F(T) is con- 
tained in some field K,, = k(Am) because, among other reasons, the above 
Corollary shows that K,,/k is a geometric extension. Thus, it cannot con- 
tain a constant field extension of k (recall that all finite extensions of a 
finite field have a cyclic Galois group). Let’s work within a fixed algebraic 
closure of k. Define k(A) to be the union of all the fields K,,. Secondly, 
let k = Fk be the maximal constant field extension of k. These fields are 
abelian and disjoint and one might think that every abelian extension of k 
is a subfield of the compositum of k(A) and k. However, this field is still 
not big enough since a moment’s reflection shows that a subfield of this 
compositum must be tamely ramified at oo. To construct abelian exten- 
sions of k that are wildly ramified at infinity, work with the parameter at 
infinity, i.e., 1/7’, rather than T. One considers the ideal (1/7) in the ring 
F(1/T] and using the Carlitz construction for this situation produces the 
fields k(Ap—n-1). These fields are abelian over k, totally ramified at oo, 
and [k(Ap-n-1) : k] = q"(q—1). Let Ly, be the unique subfield such that 
[L, : k] = q” and set Log equal to the union of all the fields, L,,. These 
three fields, k(A), k, and Loo, are disjoint and the main theorem states 
that every abelian extension of k is contained in their compositum. In this 
sense, the Carlitz module gives an explicit construction of the maximal 
abelian extension of k. Hayes’ proof relies heavily on class field theory. In 
the case of the Kronecker-Weber theorem it is possible to produce more 
elementary proofs so it is certainly possible that a more elementary proof 
can also be given in the function field case. We leave this as a challenge to 
the interested reader. 


To conclude this chapter, we sketch the proof of Theorem 12.12, the 
Newton polygon method. Our sketch will include enough details so that 
giving a complete proof will only involve setting up a formal induction step 
rather than the informal one given below. 


Proof of Theorem 12.12. (Sketch) 


We begin by noticing that we can assume that ag = 1. This is because 
the Newton polygon of an a,/ao x) is the same as that of f(x) except 
that it is shifted vertically by —ord,ao. The roots remain the same and the 
length of the line segments and the slopes remain the same, so we may as 
well assume that ap = 1. 
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Secondly, since zero is not a root, we may as well work with the inverse 
roots rather than the roots. We will thus connect the slopes of the line 
segments with the ord,s of the inverse roots (no negative sign). Thus, write 


d 
f(«) =| [Q-7m2), 
i=1 


and arrange the inverse roots such that 


ord, 7 =-+++ =ordyty, = $1 < ordytr,41 
=-+) = ordytr, yr. = 82 < OrdyTr 47541 


=" Ordy Tr, +ro+r3 = $83< Ordy Mr, 4+rg+r3+1 = etc. 


Having ordered the inverse roots in this fashion, we claim that the ver- 
tices of the Newton polygon are 


Po = (0,0), Pi = (71,7181), Po = (71 +17 2,7151 + 7282), 
Py = (ry +7ro2+73,7181 +7282 +1383), 


etc. Assuming for the moment that this is the case, we see that the theorem 
is established since the difference of the x-coordinates of P;_, and P, is r; 
and the slope of the line connecting them is s;. By the way, we have grouped 
the inverse roots, we see there are precisely r; of them such that ordy7 = s;. 

To prove our assertion about the vertices, notice that the j-th coefficient 
of f(x), namely a,, is the j-th elementary symmetric function of the inverse 
roots 7;. If0 < 7 < r;, then from the form of the j-th elementary symmetric 
function we see that ord,a; > js,. It follows that the slopes of the lines 
connecting (j, ord,a,;) to (0,0) are all greater than or equal to s; for 7 in 
this range. However, we must have ord,a,, = 118,, since only one term 
in the r,-st symmetric function has this order, namely, 7172---7,,, all the 
other terms having greater order. By exactly the same reasoning we see that 
for h in the range 0 < h < rq we have ord,a,y, 4, > 7181 + hse, whereas 
ordyQr,4r, = 7181 +7282 (the term of the ry +7r9-th elementary symmetric 
function having the smallest order being 71 -+-+7,,7r,41°''Tr,+4r,). Thus, 
the lines connecting these points to (r,, ord,a,, all have slopes greater than 
or equal to s2, whereas the line connecting (71,7151) to (r1 +12, 7181 +1282) 
has slope exactly s2. In general, let h vary in the range 0 < h < r; and 
consider the point with index r = ar rm + h. Looking at the r-th 
elementary symmetric function of the inverse roots we see 

i-1 
ord, ar > S- Tm8Sm + hs; . 


m=0 


Thus, the slopes connecting these points to P,_; are greater than or equal 
to s;, whereas the slope connecting P; to P;_, is exactly s;. Since the slopes 
$s; are monotone increasing, this is sufficient information to conclude the 
proof. 


216 


Michael Rosen 


Exercises 


Throughout the exercises, F will denote a finite field with q elements, A = 
F(T], the polynomial ring over F, and k = F(T) the quotient field of A. If 
m € A we set |m| = q%&™. If m € A, Am will denote the m-torsion points 
on the Carlitz module. 


1. 


Let P € A be a monic irreducible polynomial and A a generator of 
Ap. Show that P = [Taeg a<deg P Cu(A), where the product is over all 
non-zero polynomials of degree less than deg P. 


. Let M denote all monic polynomials of degree less than deg P. Set 


m = [Jem Ca(A). Use Exercise 1 to show that P = (—1)%8P x97! 
Set P* = (—1)98P P. Then P* = 797}. 


. (Continuation) Let Q 4 P be another monic irreducible polynomial. 


Recall the symbol (a/Q), which is defined to be the unique element 


of F such that 
Q\-1 


qt = (5) (mod Q) . 


Use the fact that og is the Artin automorphism at @ in the field 
Kp = k(Ap) to prove that og(m) = (P*/Q)r. 


. (Continuation) Use Theorem 12.10 to show og(7) = [Jgeny Caa(A). 


Now, use Gauss’ criterion (see Exercise 10 of Chapter 3) to show that 


oq(m) = (Q/P)r. 


. (Continuation) Combine Exercises 3 and 4 to prove the reciprocity 


law; i.e., if P # Q are two monic irreducibles, then 


(8) (8) -eorrme 


This nice proof is due to Carlitz. 


. Let P be a monic irreducible of degree d. Use the Riemann-Hurwitz 


formula to prove that the genus of Kp = k(Ap) is (d—1)q?7+1—- 
2d—(l+q+---+q71). 


. Let e > 0 be an integer. Compute the genus of Kpe. (When e > 2 


the calculation is more difficult because the extension is not tamely 
ramified. One needs to compute the exponent of the different at the 
prime above P by local considerations. See Hayes [1].) 


. We continue to assume that P is a monic irreducible of degree d. 


Let e > 1 and let be an A-generator of Ape. Let (A)o. be the 
polar divisor of X. Show deg (A)o = q?71 if e = 1 and deg (A) = 
g’&-D4-1(g4 — 1) ife > 2. 
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9. Let m € A be a monic polynomial, A a non-zero element of Am, 


10. 


and o an element of the Galois group of K,,/k. Prove that aA/d 
is a unit in O,,. (These units are called cyclotomic units since they 
are analogous to cyclotomic units in Q(¢,,). We will encounter them 
again in Chapter 16.) 


Let m € A be a monic polynomial. Define Qo to be the index [O* : 
O-**|. Show that Qo = 1 if m is a prime power and that Qo = 
q —1 if m is not a prime power. Hint: Try imitating the proof of the 
corresponding fact in cyclotomic number fields. 


13 
Drinfeld Modules: An Introduction 


In the last chapter we introduced a special class of Drinfeld modules for the 
ring A = F(T] defined over the field & = F(T) and discussed some of their 
properties. By considering the Carlitz module, in particular, we were able 
to construct a family of field extensions of k with properties remarkably 
similar to those of cyclotomic fields. In this chapter we will give a more 
general definition of a Drinfeld module. The definition and theory of these 
modules was given by V. Drinfeld in the mid-seventies, see Drinfeld [1, 
2). The application of the rank 1 theory to the class field theory of global 
function fields is due to Drinfeld and independently to D. Hayes [2]. The 
article by Hayes [6] provides a compact introduction to this material. A 
comprehensive treatment of Drinfeld modules (and, even more generally, 
T-modules) can be found in the treatise of Goss [4]. 

In this chapter we will develop the beginnings of the general theory, but 
will not pursue it further. Our aim is to supply the reader with some of 
the basic ideas and, hopefully, the stimulus to pursue the study of these 
modules further. Many beautiful and deep applications have already been 
discovered. However, the subject remains young and is under active devel- 
opment. 


Let k/F be a function field with exact field of constants F, a finite field 
with q elements. Let oo be a fixed prime of k and let A C k be the ring 
of all elements of k whose only poles are at oo. It is well known that 
A is a Dedekind domain whose non-zero prime ideals are in one-to-one 
correspondence with the primes of k different from oo. If (0, M) is a prime 
of k such that A C O, the corresponding prime ideal of A is MMA. On 
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the other hand, if P C A is a non-zero prime ideal of A, then (Ap, PAp) is 
the corresponding prime of k. It is clear that the polynomial ring A = F(T] 
considered in the last chapter is a special case of this construction. In this 
special case, the degree of the prime at infinity is 1, but that might not be 
so in the general case. We set d, to be the degree of the prime at infinity. 

Let L be a field containing F and G,/,;, the additive group scheme over 
L. If Q is an L-algebra, Ga/z assigns to its additive group Q4, ie., 2 
considered solely as an additive group under addition. In homological al- 
gebra this is sometimes referred to as a “forgetful functor.” The additive 
group scheme over L assigns to every L-algebra its structure as an additive 
group, forgetting about the multiplicative structure. The endomorphism 
ring of Gaz, over F, Endp(Ga/z), assigns to every L-algebra Q the alge- 
braic endomorphisms of 24 which commute with the action of F. Using 
Proposition 12.2, one can show that Endg(Ga/,) = L < 7 > where 7 
is the map which raises an element to the q-th power. So, if wu € Q and 
\-ca7' € L <7 >, then 


(S- c;T')(u) = So cut ; 


The right-hand side of this equation is an additive polynomial with coeffi- 
cients in L. The endomorphism ring of G,,;; over F can thus be considered 
either as the non-commutative polynomial ring L < 7 > with the key 
relation Ta = a!7, or as the ring of additive polynomials over L with mul- 
tiplication being given by composition. Both descriptions are useful and, 
after a little experience, no confusion is likely to result from employing 
them both. 

The map D: L < 7 > L given by D(3~*c;7*) = cp is a homomorphism. 
It will play a role in the definition of a Drinfeld module, which we are 
about to give. In the alternate world of additive polynomials, D applied to 
Sox? is just differentiation with respect to z. 


Definition. A Drinfeld A-module over LZ consists of an F-algebra ho- 
momorphism 6 from A to L, together with an F-algebra homomorphism 
p:A-—-+L< 7 > such that for all a € A, D(pa) = 6(a). Moreover, we 
require that the image of p not be contained in L. The notation Drin,(L) 
will denote the set of all Drinfeld A-modules over L, the structural map 6 
being assumed fixed. 


In practice the map 6 is often just containment in a field L, but it is also 
occurs as reduction modulo a prime ideal. In the last chapter, it was just 
the containment of A in k. 

A simple but useful, remark is that pg, = a7, the identity of L < 7 >, 
for all a € F. This is because p is an F-algebra homomorphism taking 1 to 
T°. Thus, Pg = ap; = ar® for alla € F. 

If Q is an L-algebra, then 6 makes Q into an A-module in the obvious 
way, namely, a-u = 6(a)u. The idea of a Drinfeld module is that it makes 
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Q into an A-module in a new way which is a “deformation” of the standard 
one. Namely, if we define a x u = pq(u), it is straightforward to check that 
this makes 2 into an A-module. This is a deformation of the standard 
action since a: u and a * wu both have the same linear term, 6(a)u. It is a 
new action since from the definition of a Drinfeld module, a-u # a *u for 
at least one a € A. When considering 2 as an A-module under the action 
of p we shall use the notation Q,. In this chapter, the only algebras we will 
consider are field extensions of L. 

Let p € Drin,(L) and M/L be an algebraically closed field extension. 
Consider the A-module, M,. If 0 # a € A, we want to investigate the 
structure of the torsion submodule M,|a] = {u € M, | pa(u) = 0}. More 
generally, if (0) 4 I Cc A is an ideal in A, we want to investigate the 
structure of M,[I] = {u € M | pa(u) = 0,Va € I}. In the course of doing 
this we will have to define and explain the notions of the rank and height 
of a Drinfeld module. 

We begin by defining another notion, the A-characteristic of DL, con- 
sidered as an A-module via 6. If 6 is one to one, we say that LD has A- 
characteristic 0. If 6 is not one to one, its kernel is a non-zero prime ideal 
Q of A. In this case we call Q the A-characteristic of L. This notion is not 
to be confused with the characteristic of L as a Z module. For all fields un- 
der consideration in this chapter, this characteristic is p, a non-zero prime 
number in Z. The A-characteristic of L is a completely different notion. 

A reader familiar with the arithmetic of elliptic curves or, more generally, 
abelian varieties, will find the form of the following result and its corollary 
quite familiar. 


Theorem 13.1. Let p € Drin,(L) as above. Let Q be the A-characteristic 
of L. Finally, let M be an algebraically closed field containing L. If PAQ 
1s a non-zero prime ideal of A, ande > 1 an integer, then there 1s a positive 
integer r independent of P and e such that 


M,|P*] = (A/P*) . 


If Q 4 (0) and e > 1 is an integer, there is another integer h, independent 
of e, such that 


Mp 1Q?] = (A/Q°*V°- 


Corollary. Let I Cc A be an ideal relatively prime to the A-characteristic 
of L. Let p € Drin,(L) and let M be an algebraically closed field containing 
L. Then there is an integer r, independent of I, such that 


M,[I] & (A/D™ . 


We will not prove these results now, but we will do so after developing 
some preliminary machinery. The integer r will turn out to be the rank 
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of p and the integer A will turn out to be the height of p, concepts whose 
definition will be given shortly. 

The next few lemmas will be very general results about torsion modules 
over a Dedekind domain. Since these facts are fairly standard, we will only 
sketch the proofs, assuming that the reader either already knows these 
results or can fill in the details without difficulty. In these lemmas, A will 
be a Dedekind domain and M a module over A. If J C A is a non-zero 


ideal, then M[I] = {me M | ax = 0,Vae I}. 


Lemma 13.2. Suppose J and H are non-zero ideals of A which are rela- 


tively prime; t.e., J+ H =A. Then M[JH] = M[J| 6 M(H]. 


Proof. By hypothesis, there exist a € J and b € H such that a+b = 1. For 
m € M|JH]| we have m = am+ bm. Since am € M[H] and bM € M{J], 
we have shown M[JH]| = M|J| + M|H]. To prove that the sum is a direct 
sum, let m € M[I]N M[J]. Then m = am + bm = 0+0 = 0, and we’re 
done. 


Corollary. Let I = Pj! P;?.-- Prt be the prime decomposition of the ideal 
I # (0). Then, 


MI] = M[Py"] ® M[P3*] @ +» M[Pe"] . 


Proof. This follows from the lemma by a simple induction on t. 


If P is a maximal ideal of A, let’s define M(P) = US2, M[P°]. This is 
called the P-primary component of M. 


Lemma 13.3. Let M be a torsion A-module. Then 


M=QM(P), 
P 


where the sum is over all maximal ideals of A. 


Proof. Let m € M. Since M is a torsion module, there is a non-zero a € A 
such that am = 0. Consider the prime decomposition of the principal ideal 
(a) and apply Corollary 1 to Lemma 13.2. This shows that M = >>, M(P). 

To show the sum is direct, suppose 0 = 7;_, mi, where m; € M(P,). 
For each 7 with 1 <7 < t, there is an e; > 0 such that m; € M[P*]. Now 
apply the corollary to Lemma 13.2. 


Lemma 13.4. If (0) ~ M,; — Mz > M3 — (0) is an exact sequence of 
torsion A-modules and P C A is a maximal ideal, then (0) + Mi(P) > 
Mo2(P) > M3(P) > (0) is also exact. 


Proof. This follows easily from Lemma 13.3. 


13. Drinfeld Modules: An Introduction 223 


Lemma 13.5. Let P be a mazimal ideal of A and select r € P—P?. Then 
M[P*| = M[r°|(P) . 


Proof. We have (7°) = P®J where P and J are relatively prime. By 
Lemma 13.2, 
M|r°| = M[P*])6 MJ] . 


Taking the P-primary component of both sides gives the result, since 


M[J\(P) = (0). 


Lemma 13.6. Suppose M is a divisible A-module, P C A a maximal ideal, 
and e>1 and integer. The following sequence is exact: 


(0) + M[P] — M[P*] > M[P**] = (0) , 
where the third arrow is given by multiplication by 7 € P — P?. 


Proof. Choose 7 € P — P?. Using the divisibility of M, we see that the 
following sequence is exact: 


(0) + M[x] > M[n®] — M[r®") — (0). 


The result follows by taking P-primary components and using Lemmas 
13.4 and 13.5. 


Corollary. Suppose that M[P] is finite. Then M|P°] is finite for alle > 0 
and #M([P*°| = #M|P|°. 


Proof. This is immediate from the lemma and a simple induction. 


We will now give a result which provides the basis for the definition of 
the rank of a Drinfeld module. 


Proposition 13.7. Let p be an A-Drinfeld module defined over a field L. 
Define u(a) = —deg, pa for alla € A. Then there is a positive rational 
number r such that (a) =r ordg(a)dg for alla € A. 


Proof. If we define (0) = 00 we easily check that u gives a map from A 
to ZU oo such that u(a) = co if and only if a=0, p(ab) = u(a) + Ld), 
and u(a +b) > min(u(a), u(b)). Moreover, (a) < 0 for all a € A and 
u(a) < 0 for some a € A. These properties show that 4 can be extended 
to an additive valuation on the quotient field k of A. It cannot be one of 
the valuations associated to maximal ideals of A since all these have non- 
negative ord on A. Thus, must be equivalent to the valuation at infinity. 
This shows there is a real number r such that pu(a) = 7 ordgg(a)doo. Let 
a € A be such that deg,(p,) > 0 (the existence of such an a € A is specified 
in the definition of Drinfeld module). Then, (a) is a negative integer. We 
claim ord.,(a) is also a negative integer. To see this, note that the remark 
immediately following the definition of a Drinfeld module shows that a is 
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not a constant. Since a has no pole at primes corresponding to maximal 
ideals in A, it must have a pole at infinity (see Proposition 5.1). It follows 
that r is a positive rational number, as asserted. 


Definition. The rank of a Drinfeld A-module p is defined to be the unique 


positive, rational number r , such that deg, pg = —r ord (a@)dog for all 
aca, 

We will soon see that the rank of a Drinfeld module is actually a positive 
integer. 


It is illuminating to reformulate the definition somewhat. For a € A 
define deg a to be the dimension over F of A/aA. This is clearly a gener- 
alization of the degree of a polynomial in a polynomial ring over a field. 
(Caution! This is not the degree of the principal divisor (a) which we know 
is zero.) As we will see from the following proof, deg a coincides with the 
degree of the zero divisor of a. 


Proposition 13.8. For alla € A we have dega = —ord.(a)d. Thus, 
the rank of a Drinfeld A-module p, can also be defined to be the unique 
positive, rational number r such that deg,(pa) =r dega for allae€ A. 


Proof. Let mp = ordp(a) and aA = [|p P™? be the prime decomposition 
of the principal ideal aA. By the Chinese Remainder Theorem, we have 


A/aA= €)A/P™? . 
P 


For any maximal ideal P C A, we have A/P™ & Ap/(PAp)™. Since PAp 
is a principal ideal, (PA,)'~'/(PAp)' = Ap/PAp for all i > 1. Thus, 


dimp(A/P™) = m dimp(Ap/PAp) =m degP. 


We conclude that dega = )/p,,. ordp(a) deg P. Since the degree of a 
principal divisor is zero (Proposition 5.1), it follows that 


dega = S- ord p(a) deg P = —ordgo(a)doo , 
Poo 


as asserted. 


The next topic to consider is the height of a Drinfeld module. This is 
of interest only for Drinfeld A-modules p of non-zero characteristic. Recall 
that the A-characteristic of p is the kernel of the structural map 6: A > L, 
where L is the field of definition of p. Call this ideal Q and suppose that 
Q # (0). For a € A, let w(a) be the index of the smallest power of 7 in pg 
with non-zero coefficient. In other words, if pg = S50 Cm7T™ and c; = 0 
for i <n but c, £0, then w(a) = n. Define w(0) = oo. 


Proposition 13.9. There is a positive rational number h with the property 
that for alla € A we have w(a) = h ordg(a) deg Q. 


13. Drinfeld Modules: An Introduction 225 


Proof. The map w takes A to ZU oo and has the following properties: 
W(a) = oo if and only if a = 0, w(ab) = w(a) + (db), and w(a+b) > 
min(w(a),w(b)). Moreover, w(a) > 0 for all a € A, and w(a) > 0 if and only 
if a € Q. It follows easily from these facts that w extends to an additive 
valuation of k which is equivalent to ordg(*). Thus, there is a real number 
h having the required property. To show that h is positive and rational, 
just choose a € Q with a # 0. Then w(a) is a positive integer and so is 
ordg(a). Thus, fh is a positive, rational number. 


Definition. If p is an A-Drinfeld module with A-characteristic zero, de- 
fine the height of p to be zero. If p has A-characteristic Q # (0), de- 
fine the height to be the unique positive, rational number h such that 
w(a) = h ordg(a) deg Q for alla € A. 


We are now in a position to prove Theorem 13.1 and its corollary. In the 
course of the proof we will show that both the rank and the height of p are 
integers. 


Proof of Theorem 13.1. We recall that M is an algebraically closed field 
containing L and that p € Drin,(L). Let P C A be a prime ideal different 
from the A-characteristic Q of L. Choose b € P with b #0. The A-module 
M,|P] is finite since M,[P] c M,[b] and the latter set is finite being the 
set of zeros of pp(u). If follows that M,[P] is a finite dimensional vector 
space over A/P of dimension d, say. Thus, 


#:M,[P] _ ,ddeg P 


The class group of A is finite. We borrow this result from Chapter 14, 
(see Corollary 2 to Proposition 14.1 and take the set S in that proposition 
to be S = {oo}). It follows that there is a positive integer m such that 
p™ = aA, a principal ideal. Thus, #M,|P™] = #M[a]. We compute the 
size of both sides. By the corollary to Lemma 13.6, we have 


#M,[P™] _ #-M([P]™ _ gird deg P 


Ifa € Q, the A-characteristic of p, then P™ C Q, and it would follow 
that P = Q, contrary to assumption. Thus, a ¢ Q and p,(u) is a separable 
polynomial (its derivative with respect to u is 6(a)). Thus, 


#:M,|a] _ gi°8r Pa — g” 1°84 


where the last equality follows from Proposition 13.8. Here, r is the rank 
of p. 
Since P” = aA, we have dega = mdeg P. Thus, 


mddeg P=r dega=rmdegP. 


Cancelling mdeg P from both sides, we conclude r = d. This shows that 
the rank is an integer and proves the first assertion of Theorem 13.1 in 
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the case where e = 1. The general case follows from what we have already 
proven, Lemma 13.6, and the structure theorems of finitely generated, tor- 
sion modules over Dedekind domains. We leave the details to the reader. 
(See Exercise 1 at the end of the chapter.) 

Suppose now that the A-characteristic @ is not the zero ideal. Again, 
using the finiteness of the ideal class group we see there is a positive integer 
n such that Q” = bA, a principal ideal. It follows that #M,[Q"|] = #M,|b]. 

We first count the number of elements in M,/b]. Since the polynomial 
p(w) is inseparable in this case, we cannot simply use its degree. However, 
it is clear that by factoring out 7“) on the right, we can write pp = pir”), 
where p},(u) is separable and deg, p;, = deg, pp — w(b). Since 7 is a one to 
one and onto map from M — M, we conclude 


#M(b| _ gie8 Po—w(b) __ gr dee o—h ordg(b) deg Q 


The last equality follows from Propositions 13.8 and 13.9. 
On the other hand, 


#M,[Q"] = #M,[Q]" = a"? 22 , 


where d’ is the dimension of M,|Q] considered as a vector space over A/Q. 
Using the last two equations together with the facts, deg b = ndeg Q and 
ordg(b) =n, we find 


nd’ deg Q = rndeg Q — hndegQ . 


Cancelling n deg @ from both sides, we conclude d’ = r — h. This proves 
h is an integer and simultaneously proves the second assertion of Theorem 
13.1 in the case when e = 1. As before, the general case follows without 
much difficulty . 


Proof of the Corollary to Theorem 13.1. Since J is prime to Q, Q 
does not occur in the prime decomposition of [ = Pj! P;?... Prt. By the 
Theorem, we have for each 7 with 1 <i < t, 


M[P] & (A/P%) | 


Sum both sides from 1 to t. The result follows from the Corollary to Lemma 
13.2 together with the Chinese Remainder Theorem. 


Having defined Drinfeld modules and discussed their torsion points and 
the notions of rank and height, we now proceed to define maps between 
Drinfeld modules, i.e., we want to study the category of such objects. 


Definition. Let p, p’ € Drin,(L). A morphism from p to p’ is an element 
f of L <7 > with the property that fo, = p!,f for all a € A. The set of 
all such morphisms is denoted by Homz(p, p’). 

Under the addition in L < 7 > it is easy to see that Homz(p, p’) is 
an abelian group. Also, the product in L < 7 > gives a bi-additive map 
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(f,9) - gf from Hom,z(p, p’) x Homz(p’, p”’) to Homz(p, p’”). In particular, 
these two operations make Homz(p, p) = Endz(p) into a ring. 

Let 2 be an L-algebra and p,p’ € Drin,(L). Let f ¢ Homz(p, p’). Then, 
u— f(u) is a homomorphism of 0 > Q as abelian groups and, as is easily 
checked from the definition, an A-module homomorphism from 2, — Q,.. 

If, as is often the case, we take 2 = M, an algebraically closed field 
extension of L, we see that u > f(u) is an onto map from M, > M,, 
with finite kernel (the zeros of f(w)). For this reason we may refer to non- 
zero elements of Homz(p,p’) as isogenies. Also, we say that p and p’ are 
isogenous over L if Homz(p, p’) ¥ (0). 


Proposition 13.10. [fp and p’ are isogenous Drinfeld modules, then they 
have the same rank and height. 


Proof. Let 0 4 f € Homz(p, p’) and choose a non-constant element a € A. 
Then, fa = pf. Taking the degree with respect to 7 of both sides shows 
that deg, a = deg, p’,. Thus, rdeg(a) = r’ deg(a), where r and r’ are the 
ranks of p and p’, respectively. This shows r =r’. 

That the heights of p and p’ are equal follows from similar reasoning. 


The identity of Homz (p, p) is clearly r° for all Drinfeld modules. What is 
an isomorphism? By definition, f € Homz(p, p’) is an isomorphism if and 
only if there is a g © Homz(p’, p) such that fg = 7° = gf. In the twisted 
polynomial ring L < 7 > this can only happen if f = er° and g = c7!1° 
for some non-zero element c € L. Thus, p and p’ are isomorphic if and only 
if there is ac € L* such that cp, = pic for alla € A. 

Suppose J Cc A is a non-zero ideal and that p € Drin,(L). We are 
going to construct a new Drinfeld module p’ and a non-zero isogeny py € 
Hom,(p, p’). This construction plays an important role, especially in the 
arithmetic applications of the theory of rank 1 Drinfeld modules. 


Lemma 13.11. The ring L < 7 > has a division algorithm on the right. 
More precisely, if f,g € L <1 > andg #0, there evist s,r € LD <T> 
such that f = sg+r withr =0 or deg,r < deg, g. 


Proof. The proof is just about the same as in the case of a commutative 
polynomial ring. For details see Goss [4], Chapter 1. 


Corollary. Every left ideal in L <7 > is principal. 


Proof. If J Cc L < 7 >is a non-zero left ideal, let g € J be an element of J 
with smallest degree in 7. If f € J, then by the theorem we have f = sg+r 
with either r = 0 or deg, r < deg, g. The latter alternative is impossible by 
the definition of g and the fact that r € J. Thus, r = 0, so every element 
of J is a left multiple of g. 
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Definition. Let J Cc A be an ideal and J the left ideal in LD < 7 > generated 
by the set {py | b € I}. Define p; to be the unique monic generator of the 
left ideal J. 


Proposition 13.12. Let M be an algebraically closed field containing L. 
Then, 
My[I] = {4 €M | pr(d) =O}. 


Proof. Suppose 4 is a root of p;(x). If b € J, there is an fp Ee L<T > 
such that f,o; = py. Thus, 0 = f,(0) = fo(pr(A)) = po(A). It follows that 
the roots of p; are contained in M,[J]. 

On the other hand, there exist b; € J and f; ¢ L<7>,withl1<j <t, 
such that 


t 
PI = 3 Fi; Pb, ° 
j=1 


From this it follows easily that every element of M,[J] is a root of p7(z). 


Proposition 13.13. Let p € Drin,(A) and (0) #1 CA be an ideal. Then 
there is a uniquely determined Drinfeld module I * p € Drin,(A) such that 
pr 1s an isogeny from p to I * p. 


Proof. The left ideal J c L < r > generated by the set, {~ | b € I}, is 
mapped into itself under right multiplication by pg for any a € A. Thus, for 
alla € A, there isa p/, € L <7 > such that prpa = pip. A straightforward 
calculation shows that the map a — p’, is an F-algebra homomorphism from 
A>~L<rT>. 

Define 6’(a) = D(pi,). This is easily seen to be an F-algebra homo- 
morphism from A — L. If we knew that 6’ = 6 it would follow that 
p’ € Drin, (A) and by setting I* p = p’, the proof would be concluded. As a 
matter of fact, 6’ is equal to 6, but to show this requires a little more work. 
We will postpone the proof for a while (see the corollary to Proposition 
13.18). For the moment we simply note that J * p is a Drinfeld module 
with, perhaps, a different structural map from that of p. 


It is instructive, and useful, to understand p; when J is a principal ideal. 
Let J = (b). Then, by definition, p; is the unique monic generater of the 
left ideal in L < 7 > generated by py. Let c € L be the leading coefficient 
of pp. Then, clearly, py») = c~* po. 


Proposition 13.14. If p € Drinz(A) and0 4b € A, then po) = c7 pr, 
where c is the leading coefficient of py. Moreover, c|(b) * pla = pac for all 
a € A; 1.e., (b) * p ts isomorphic to p over L. 


Proof. We have already proven the first assertion. To prove the second, 
note that for all a € A we have pi»)pa = [(b) * plays). Thus, c7!pypa = 
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[(b) * plac” pp. Since pyPa = Paps We can cancel p, (because L < 7 > has 
no zero divisors) and conclude that c~1p, = [(b) * plac’. 


The *-operation has a number of important properties. The following 
proposition provides two of the most useful of them. The proof is fairly 
straightforward, but somewhat tedious. We will leave the proofs as an ex- 
ercise. A good reference is Goss [4], Lemma 4.9.2. 


Proposition 13.15. Let p € Drinz(A) and I,J Cc A be non-zero ideals. 
Then 
pry =(I* p)ypr and Ix*x(J*p)=IJ*p. 


The first of these relations will be especially useful in our next task, which 
is to generalize the relations deg, p, = rdega and w(a) = hdegQ ordg(a) 
to the isogenies p;. To do this we will need a new definition. 

Let w, : LD < tT >-» Z be the map which assigns to a non-zero element 
f of L < rT > the smallest index of the non-vanishing coefficients of f. In 
other words, if f = an c;7', the w-(f) = io if c; = 0 for i < ip and 
Cio ft QO. 

It is clear that w,(fg) =w,(f)+w-(g) and that w,(f) = 0 if and only if 
f(x) is a separable polynomial in x. If p is a Drinfeld module then, in our 
previous notation, w(a) = w,(p,). We introduce this new mapping because, 
among other things, when dealing with more than one Drinfeld module, the 
notation “w(a)” is ambiguous. 


Lemma 13.16. Let p € Drin,(A) and let I C A be an ideal prime to the 
A-characteristic Q of L. Then pz(x) is a separable polynomial. 


Proof. Let a € I with a not in the A-characteristic of L. We have (a) = IJ 
for some ideal J. By Proposition 13.15, pia) = (I*p) spr. Since a is not in Q, 
we know w,(pa) = 0 (see Proposition 13.9). Thus, w;(pja)) = w7(Pa) = 9. 
The result now follows, because 

0 = wr(p(ay) = wWe( (I * p)z) +er(p2) 
which implies that w,(p7) = 0. 


Proposition 13.17. Let p € Drinz(A) be a Drinfeld module of rank r. Let 
ICA be a non-zero ideal. Then, deg, py = r deg I. 


Proof. To begin with, let us assume that J is prime to the A-characteristic 
of L. Let M be an algebraically closed field containing DL and consider 
M,|Z]. By Lemma 13.16 and Proposition 13.12, we see that 


#:M,|I] _— ghee pr 
On the other hand, by the Corollary to Theorem 13.1, we see that 


#M,|I] = gr ee! 
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Combining these two facts gives the result in the present case. 

Assume now that the A-characteristic, Q, is not zero, and that J is a non- 
zero ideal divisible by Q. By a standard result in the theory of Dedekind 
domains, there exist elements a,b € A and an ideal J, prime to Q, such 
that aJ = bI. Applying the first part of Proposition 13.15, we find 


(J * p)ayps = 1 * p)wypr - (1) 


Since isogenous Drinfeld modules have the same rank and using the fact 
that pq) differs from pz by multiplication by a non-zero element of L, we 
deduce from Equation 1 that 


rdega+deg, py; =rdegb+rdegl . 


We have used Proposition 13.8 and the first part of the proof applied to J. 
From the relation aJ = b/ we find that dega + deg J = degb+ deg /. 
Putting all this together, we see that deg, p; = rdeg J, as asserted. 


Proposition 13.18. Let p € Drin,(A). Assume that the A-characteristic 
of L, Q, is not zero. Let J Cc A be a non-zero ideal. Then w,(ps) = 
hdeg @ ordgJ, where h is the height of p. 


Proof. If J is prime to Q, the w;(p7) = 0 by Proposition 13.16. We also 
have ordgJ = 0, so the proposition is proven in this case. 

Now, assume @ divides J. Then as above we can write aJ = bI, where 
I is an ideal prime to Q. Applying w, to both sides of Equation 1, we find 


hdeg Q ordg(a) + w,(p7) = hdegQ ordg(b) +0. 


We have used the fact that isogenous Drinfeld modules have the same 
height, Proposition 13.9, and the first part of the proof applied to the ideal 
I, 

Since aJ = bI, we have ordg(a) + ordgJ = ordg(b) + 0. Thus, 


W7(py) = hdegQ ordg(b/a) = hdeg@ ordgJ , 


and the proof is complete. 


Corollary. Let p be a Drinfeld module over a field L with structural map 
6:A->L. Let IC A be an ideal, and p’ = I * p with structural map 
6°: A> L. Then 6 = 6’. 


Proof. If p has A-characteristic zero, then by Lemma 13.16, p;(z) is a 
separable polynomial for all ideals J. This implies that the constant term 
of pr € L <7 >, c(J), is not zero. Setting I * p = p’, consider the equation 
PIPa = P,pr- Comparing the constant terms on both sides yields c(J)é(a) = 
6’(a)c(I). Since c(1) 4 0, we conclude 6(a) = 6’(a) for alla € A. 

Now assume that the A-characteristic of p, Q, is not zero. By the propo- 
sition, the first non-vanishing term of p; is of the form cr™ where c 4 0 and 
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m is an integer divisible by deg Q. Again consider the equation prpa = p),pI 
and compare the coefficients of 7” on both sides. We find 


cé(a)? =6'(a)e andso d(a)? =6'(a). 


Now, 6 induces an injection of A/Q into L and so 6(a) is an element of 
a finite field with q¢°@@ elements. Since deg Q | m it follows that 5(a)?” = 
6(a). Thus, 6(a) = 6’(a) for all a € A and we are done. 


Having introduced the general notion of Drinfeld module, division points, 
rank and height, morphisms (isogenies), and some of their properties we 
now break off the general development to ask the possibly embarrassing 
question about whether Drinfeld modules exist. When A is a polynomial 
ring, there is no problem. As we observed in the last chapter, when A = 
F|T|, we simply assign to T any element of L < 7 > with constant term 
6(T) and this automatically extends to a homomorphism p: A> L <T > 
with the property that D(p.) = 6(a), i.e., a Drinfeld A-module over L. 
When A is more general, it is not clear that there are any elements of 
Drin,(L). Indeed, why should the non-commutative ring L < 7 > have 
a commutative subring isomorphic to A? To construct Drinfeld modules 
in the more general situation we follow Drinfeld and introduce analytic 
methods. The construction will be similar to the construction of elliptic 
curves over the complex numbers C by means of two dimensional Z-lattices 
and the associated Weierstrass P-functions. 


Recall that k.. is the completion of k at the prime oo. Let ko be the 
algebraic closure of k,,. The (normalized) valuation on ko 


|Ploo _ g7 Ordos (0) doo 


extends to kg uniquely by means of the formula 


IYloo = |NE/e.0(Y) fle Roe) 
where & is any intermediate field containing y and of finite degree over ko. 
We now define C to be the completion of k., with respect to | * |oo. 
This valuation extends uniquely to C and C is complete. It is also well 
known that C is algebraically closed. The field C plays the role of the 
complex numbers in our context. The theory of infinite series and infinite 
products can be developed for functions defined on open subsets of C and 
the usual theorems continue to hold in even stronger form. In particular 
>2@n converges if the terms a, tend to zero and [](1 + a,) converges if 
>», Gn converges. A function from C — C is said to be entire if it can be 
represented by a power series 4’ a,z" which converges everywhere. The 
set of zeros of an entire function form a discrete subset of C. Moreover, an 
entire function is determined by its zeros in a much stricter way than in the 
theory over the complex variables. For example, the exponential function 
in the complex theory is a highly non-trivial function, but it has no zeros 
at all. Over C we have the following, very different, type of result. 
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Proposition 13.19. Let f(x) be a non-constant entire function on C. 
Then f(x) has at least one zero. 


Corollary. Let f(x) be a non-constant entire function on C. Then f(x) is 
onto as a map from C > C. 


Proof. Let c € C and consider then entire function —c + f(x). By the 
proposition this function has a zero, say, y. Thus, f(y) =. 


The proof of the proposition uses the Newton polygon in the context of 
power series. For a treatment of this and other results which we employ 
about analysis on C, see Goss [4], Chapter 2. 

If y is a zero of an entire function, there is a uniquely determined positive 
integer m such that f(x) = (x—y)™ g(x), where g(x) is entire and g(y) 4 0. 
The integer m is called the multiplicity of the zero y and is denoted by 
ord;=7f (x). 


Theorem 13.19. Let f(x) be an entire function on C and let {y; | i = 
1,2,3,---} be its zero set with 0 excluded if f(0) = 0. Let m; be the mul- 
tiplicity of y;,. Then, limi. 7, = oo and there is a constant c 4 0 such 


that - a 
f(2) = ea" [T] (1- =) . 
i=l a 


The integer n is equal to ordz=o f(x). Conversely, if limj5oo ¥4 = 00, then 
the above infinite product defines an entire function on C. 


Definition. A lattice is a discrete, finitely generated, A-submodule of C. 
If [ c C is a lattice, the dimension of the vector space k,l’ over ko is 
called the rank of the lattice. 

One can show that lattices are formed in the following manner. Let 
{W1,W2,°+- ,w,} C C be a set of elements linearly independent over kgo. 
Let {J1,l2,--- ,1-} be a set of fractional ideals of A. Then, 


P= Iywy + Jowg +--+ +T,w, , 


is a lattice in C of rank r. In fact every lattice of rank r has this form. This 
shows that lattices exist in abundance and in every rank. 

Let I‘; and I’, be two lattices, and let c € C be such that cl’; C To. 
Then ¢: 1; — [2 given by ¢(x) = cz, is an A-module mapping. We define 


Hom(P,,T2) = {ee C |] cl, CT 2}. 


One can show that since a lattice I’ is discrete, we must have ||. > 00 
as Y varies over the elements of I’. Thus, if we define 


er(x)=2]] € - = | 
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the result is an entire function on C (the product is over all non-zero 
elements of I’). We call ep(x) the exponential function associated to I’. It is 
characterized as being the unique entire function with simple zeros on the 
elements of I and with leading term x. It also has the remarkable property 
of being additive, as we now show. 


Proposition 13.20. Let I be a lattice in C. Then for all u,v € C and 
aéF we have 


er(u+v) =er(v) +er(v) and ep(au) = aer(u) . 


Proof. For each positive integer M, define [yy = {y ET | |y|lo0 < M}. This 
is readily checked to be a finite F vector space (as we’ve seen, |7Y|o. > 00 
for y € [). If we set 


then er(x) = limy+40 Py(xr). The result then follows from the following 
lemma. 


Lemma 13.21. Let V CC be a finite, F vector space, and set 
fy(x) = [[@-v). 
veV 
Then, Py(x) ts an F-linear, additive polynomial in x. 


Proof. We prove this by induction on the dimension of V. If dimV = 0, 
then V = (0) and fy (x) = 2, so the result is true in this case. 

Now assume that the result is true for vector spaces of dimension less 
than n and that dimV = n. Write V = W+Fy, where W isan-—1 
dimensional subspace of V. From the definition it is easy to see 


fv (x) = fw(z) I] fw(x—- ap) . 


040€F 

By induction, fw(@ — ap) = fw(x) —afw(y). It follows that 
fv (x) = fw(2)? — fw (u)?* fw (2) - 

It follows immediately that fy(ax) has the required properties. 


We see from these considerations that er(x) can be written as an infinite 
series as follows: 


ep(z) = x+ So ei(T)x? with «(T)EC. 
i=1 
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The main reason for introducing these exponential functions is that, as we 
shall now show, to every lattice we can construct an element of Drin,4(C). 
Moreover, this assignment is an equivalence of categories if we use our def- 
initions of Hom(I,,I2) for lattices and Hom(p;, p2) for Drinfeld modules. 
In particular, this will show that Drin,(C) has lots of elements. 


Let [ CI” be two lattices of the same rank. Then, [’/T is a finite A- 
module which maps isomorphically into a finite F vector subspace of C by 
means of the exponential function er(x) (note that the exponential function 
is F-linear by Proposition 13.20). Define 


P(2;I'/T)=2 |] (-=5] | 


wel’ /T 


Proposition 13.22. The polynomial P(x;I’/T) is F-linear of degree 
#-(["/T’). Its initial term is x. Moreover, 


er (u) = P(er(u);T’/T) . 


Proof. The first assertion follows from Lemma 13.21, and the second as- 
sertion is clear from the definition. 

To prove the identity, notice that P(ep(u);I’/T) is zero if and only if 
er(u) = er() for some p € I”, ie., if and only if er(u — uw) = 0. This is 
true if and only if w— uw € IT, which in turn is true if and only if u € I’. 
Thus, the right-hand side of the proposed identity is an entire function with 
simple zeros (the simplicity of the zeros is easily checked) at the elements 
of I” and the initial term is u. These conditions characterize ey (u). 


Theorem 13.23. Let Il CC be a lattice of rank r. For eacha € A, a 40, 
define pl € C <7 > by the formula 


pi (x) =aP(2,a1T/T) . 


Then, if we send zero to zero and mapa — pl for04a€ A, the result is 
a Drinfeld A module over C of rank r. 


Proof. In what follows we regard A is a subset of C via the inclusions 
A-+k—k.— C. Thus the structure map 6: A > C is just the inclusion, 
so the first thing we must show is that D(p!) = a. This, however, is clear 
from the definition. 

Next, we have to show that pl, = pl p} and pl 4p = pi, + pt. We prove 
the first and leave the second as an exercise. 

The idea is to work on the level of F-linear polynomials and use the 
exponential functions. By Proposition 13.22, 


€,-ip(u) = P(ep(u);a* T/T) . 


By looking at the zero set and the initial term, it is easy to see eg-1p(u) = 


a~ter(au). Substituting this in the above equation, and using the definition 
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of p!, yields the following fundamental identity: 


er (au) = pa (er(u)) . (2) 


One now computes 


pa(er(u)) = er(abu) = pi, (er(bu)) = 04 (94 (er(u))) - 


Since ep(u) maps C onto C, we can conclude pl, = pl p;,. The proof of the 
additive identity is similar and even easier. 

It remains to show that the rank of p' is r. To do this we must show that 
deg, pt = rdega. The degree of p! (x) as a polynomial in x is #(a7!T/T). 
Recall that as an A module I is isomorphic to the direct sum of r fractional 
ideals. Since a~!I/I & a~!A/A & A/aA, for any non-zero fractional ideal 
I, it follows that #(a7'T'/T) = q”?®*. Thus, deg, pl = r dega. The proof 
is complete. 


Theorem 13.24. Let Lat4(C) be the set of A-lattices inside C. The map 
T — p! from Lat 4(C) > Drin,(C) is one to one and onto. 


Proof. We will prove the map is one to one and only give a brief sketch of 
the proof that it is onto. 

Suppose I and I” are two lattices such that p™ = p! . It is convenient to 
work inside the ring of twisted power series C << Tt >>. This consists in 
formal power series )-°°, ciT", with the usual addition and multiplication 
except for the non-commutativity relation Tc = c!r. Clearly, C <7 > isa 
subring of C << 7 >>. Every additive power series, such as er(u), can be 
considered as an element of C << T >> applied to u where T(u) = u?. The 
fundamental relation (Equation 2) given in the proof of Theorem 13.23 can 
be rewritten as 

era = pier . 


Since pl = p!, we also have 
era = prey . 
Subtracting, we find that 
(er — era = py (er — er) . 


We want to deduce from this that ep = er. Suppose that ep — er’ is 
not zero. Since both exponential series have initial term 7°, the first non- 
vanishing term of their difference has the form cr* where 0 4 c € C and 
k > 0. Then, comparing the coefficients of r* on both sides of the above 
identity yields 

cal = ac. 


This shows a? = a for all a € A. This is false if a is not a constant. We 
have arrived at a contradiction which implies that ep = er. Since I’ is the 
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zero set of ep(x) and I’ is the zero set of ep (x), it follows that [ = I” as 
asserted. 

To prove the onto-ness of our map, let g € Drin,(C), and choose an 
a€ Awitha ¢ F. Using the method of undetermined coefficients, one finds 
a power series f € C << T >> with initial term 7°, such that fa = pof. 
One then proves this relation must hold for all a € A and that f(z) is an 
additive power series, which is, in fact, an entire function on C. The zero 
set of this function turns out to be an A-lattice [. One then proves that 
p' = p. For the details of this argument see Goss [4], Theorem 4.6.9. 


It is of interest to pause at this point and ask what is the lattice corre- 
sponding to the Carlitz module, the first Drinfeld module to appear in the 
literature. Since the Carlitz module has rank 1, the corresponding lattice 
must be of rank 1 over A = F/T]. Thus, it must be of the form A7 for some 
mw € C. Carlitz found an explicit expression for 7 as an infinite product. Set 
i] = 77 —T and F; = [i][i —1]9... [1]? (see the exercises to Chapter 1 
where some of the properties of these polynomials are set forth). Define i 
to be any q — 1-st root of T — T?. Then, we have 


7= iT] (1- a) | 


t=0 


Carlitz also computed the exponential function corresponding to the lattice 
Av. It is given by 


Actually, Carlitz did not first define the Carlitz module and then work 
out this exponential function. He was first led to construct this exponential 
function and then proved the “complex multiplication” property e4z(Tu) = 
Teag(u) + eaz(u)?. It was this remarkable property of the exponential 
function which led to the invention of what we now call the Carlitz module; 
see Carlitz [2, 3], and, also, Goss [4], Chapter 3 (in Carlitz’s papers the 
notation is somewhat different and the module he works with is defined by 
u— Tu-—vu! rather than u > Tu + u®). 


Having made a short detour to discuss the special case of the Carlitz 
module, we now return to the general theory. We have set up a one-to-one 
correspondence between Lat 4(C) and Drin,4(C) (which is rank preserving). 
We now want to deepen this relationship by showing a correspondence 
between elements of Hom(I’,I’) and Hom(p', p™ ). 


Theorem 13.25. Let T,T’ € Lat,a(C) be lattices of the same rank and 
suppose 0 #c € Hom(I,I”). Define 


f(x) = cP(a;c7'T'/T) . 
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Then, fe € Hom(p*, p' ). Moreover, c > f. is an isomorphism of Hom(I, I’) 
with Hom(p', p' ) as abelian groups (and even as F vector spaces). 

Proof. By Proposition 13.22, we know that e,-1p/(u) = P(er(u);c7'T’/T). 
As we have seen previously, e,-1p/(u) = c ter (cu). Thus, ep (cu) = 
f-(er(u)). It is convenient to work inside C << 7 >> where this rela- 
tion becomes ep-c = f,er. Multiply both sides on the right by any a € A 
and calculate. We find 


/ / 
erca =epac= pl epc=p" freer , 


and 
f-era = fepi er 


Setting both expressions equal to one another and cancelling er on the 
right yields the identity po. fe = fcp',. Since this is true for all a € A we 
have shown that f, € Hom(p!, p'). 

It is easy to check that c > f, is F linear. Since D(f.) =, it is also clear 
that this homomorphism is one to one. It remains to show that it is onto. 

We will sketch the proof of the ontoness. Suppose f € Hom/(p!, p' ‘), 
If f = 0 we may choose c = 0, so suppose f # 0. For each a € A we 
have fol = p. f. Multiply both sides of this identity on the right with 
er. We find (fer)a = p! (fer). Let c = D(f). One has to show that 
c #0. This is not too hard using the fact that f intertwines p' and o 
and that C has A-characteristic zero. Then, erc has the property that 
erica = eprac = p! eve. Thus, 


(fer —epc)a = pl (fep — ere). 


By our choice of c, the coefficient of 7° in fer — eprc is zero. By the same 
argument used at the end of the proof of Theorem 13.24, we can conclude 
that fer = erc. As power series, this says that f(er(u)) = er (cu). If 
7 € T we se that y is a root of the left hand side which implies 0 = ep (cy) 
and so cy € I’. We conclude that cf’ CI”, ie., ce € Hom(I,T’). The proof 
is concluded by showing that f = f,. The argument uses the fact that 
D(f) = D(fc) and that for alla € A, (f — fe)ph = pl (f — fe). 


The last two theorems make it possible to answer questions about the 
category of Drinfeld modules over C by considering the same question in 
the category of lattices which is much easier to analyze. As an example, we 
prove the following theorem about Drin%(C,1), the set of rank 1 Drinfeld 
A-modules over C up to isomorphism. To be more precise, Drin%(C, 1) 
is the quotient of the set Drin(C,1) of rank 1 Drinfeld A-modules over 
C modulo the equivalence p ~ p’ if and only if p and p’ are isomorphic 
over C. 


Theorem 13.26. The set Drin{(C,1) is finite with cardinality equal to 
the order of the class group of A. 
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Proof. By Theorems 13.24 and 13.25, it is equivalent to consider the set 
of rank 1 A-lattices in C up to isomorphism. Note that two lattice I and 
I’ are isomorphic if and only if there is a c € C* such that T = cl”. 

Every rank 1 lattice has the form Jw, where I is a fractional ideal of A 
and w € C*. For every fractional ideal I of A, let I be the set of lattices 
equivalent to J. Clearly if two ideals are in the same ideal class, they go to 
the same class of lattices. By the first remark, this map from ideal classes 
to lattice classes is onto. Suppose J, and Iz are two ideals such that [y = Io. 
By definition, there is an w € C* such that J, = Igw. From this equation 
we can deduce that w € k and this shows that J; and J are in the same 
ideal class. Altogether then, we have produced a one-to-one, onto map from 
the class group of A to the isomorphism classes of rank one A-lattices. This 
proves the theorem. 


We conclude this chapter with a few remarks on how to make Theorem 
13.26 into a more structural theorem. The operation star operation (J, p) > 
I * p gives an operation of the group of fractional ideals of A on Drin,(C). 
Since for a principal ideal (a) we have (a)*p is isomorphic to p, this descends 
to an action of Cl(A), the class group of A, on Drin4(C). If we restrict this 
action to rank 1 Drinfeld modules, we claim this action is one to one and 
transitive, i.e., Drin4(C,1) is a principal homogeneous space for Cl(A). 
This is the more structural form of Theorem 13.26. 

To prove this result we pass to the equivalent category of lattices, Lat 4(C). 
We have an obvious action (J,T) > IT of fractional ideals on lattices. By 
the way isomorphism between lattices is defined, it is clear that this action 
descends to an action of Cl(A) on Lat%(C), the isomorphism classes of 
A-lattices in C. If we restrict attention to rank 1 lattices we easily see that 
Lat4(C, 1) is a principal homogeneous space for Cl(A). 

There is one subtlety, however, which must be addressed before applying 
this calculation with lattices to Drinfeld modules. Namely, if I is the lattice 
associated with p, what is the lattice associated with I * p ? The answer 
is not JT’, which is a good first guess. Let c(J) = D(p;). Then the lattice 
associated with I * p is c(I)I~'T. The proof of this is not too hard to give 
using the techniques introduced in this chapter. Although the right answer 
is a little more complicated than expected, it nevertheless leads to the final 
result. 


Theorem 13.27. The set Drin4(C,1) ts a principal homogeneous space 
for Cl(A) under the action induced by (I, p) > I*p, where I is a fractional 
ideal of A and p € Drin,(C). 


We could go on to consider isomorphism classes of rank 2 Drinfeld mod- 
ules, a question which leads to the theory of Drinfeld modular curves. 
Instead, we will stop here and go to other topics. A good introduction to 
Drinfeld modular curves and their properties is found in Gekeler [2]. 
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Exercises 


1. 


10. 


11. 


Fill in the details of the proof of Theorem 13.1 in the case M[P*| 
where e > 2 and P #~ Q. You may use the following fact. If M 
is a finitely generated module over a Dedekind domain A which is 
annihilated by a power of a maximal ideal, say, P®, then M is a 
direct sum of cyclic modules of the form A/P/ where f < e. 


. Similarly, fill in the details of the proof of Theorem 13.1 in the case 


M,|Q°] where e > 2. 


. Let p,p’ € Drin,(L). If p and p’ are isogenous show that they have 


the same height. 


. Prove Lemma 13.11. 
. Prove Proposition 13.15. 
. In Proposition 13.22 we showed P(er(u),I’/P) has a zero at each 


element of I’. Show that each such zero is simple. 


. Prove that pl, =p, +p) (part of Theorem 13.23). 


. Show that in a neighborhood of zero in C we have 


u 
=l]- Gr(L)u” , 
er(u) dX (I) 
(q—1)|n 


where G,, (I) = Soyer y~", where the prime indicates that 0 is to 
be omitted. Show that the sums G,,(I) converge and discuss the 
convergence of the expression given above for u/er(w). 


. Show that the lattice associated to the Drinfeld module J * p! is 


D(I)I~'T where D(I) is the constant term of p}(r). 


In the text we discussed the lattice corresponding to the Carlitz mod- 
ule and the corresponding exponential function. The lattice is Aw 
and we gave an Carlitz’s explicit formula for 7. For convenience, set 
€an(u) = ec(u). If m € A is a monic polynomial show that the set 
of m-division points in C for the Carlitz module is given by 


A = {eg(ait/m) | a€ A, dega < degm}U {0} . 


(Continuation) Set \m = ec(it/m). Let a € A be a polynomial of 
degree less than deg m. Show that 


~ 1 
ordgoCa(Am) = degm — dega — 1 — 7° 
qd — 
This important formula was proved in Chapter 12 using the Newton 
polygon. The analytic proof, sketched here, is due to D. Goss. 


14 


S-Units, S-Class Group, and the 
Corresponding L-Functions 


Let K/F be an algebraic function field over the field of constants FP’. 
Throughout this book we have been emphasizing the analogy between the 
arithmetic of K and that of an algebraic number field. This analogy is par- 
ticulary clear when we choose an element x € K which is not a constant. 
The ring A = Fz] C k = F(z) then plays the role of the pair Z C Q in 
number theory. K is an algebraic extension of F(x) and the analogue of 
the ring of integers in an algebraic number field is the integral closure of 
A in K. Let’s call this ring B. We will show that B is a Dedekind domain. 
We will investigate the unit group and the class group of B. We will also 
associate zeta and L-functions to B. 

The ring B and its properties can be discussed in a slightly different, 
somewhat more intrinsic, way. Let oo denote the prime at infinity in the 
subfield k = F(x) and denote by S the finitely many primes in K lying 
above oo. We will show that B is the intersection of all the valuation rings 
Op for P € Sx — S (recall that Sx is the set of all primes of K’). This 
being the case, let S C Sx be any finite set of primes. Define 


Os = {a€ K | ordp(a) >0, VP € S}, 


the ring of S-integers. We will define S-units, S-divisors, S-class group, 
and even, an S-zeta function. After discussing these concepts and deriving 
their basic properties, we will show how all of this relates to the arithmetic 
properties of the field K. 

Finally, we will discuss Z-functions in a slightly more general situation. 
Namely, suppose that K/k is an abelian extension of global function fields 
and that S is a set of primes of k (not of K as in the above paragaph). We do 
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not assume that k is a rational function field. Let A be the ring of S-integers 
of k and B the integral closure of A in K. We will introduce L-functions, 
L(w, x) determined by the ring A and a character x of the Galois group. 
Using properties of these functions, we will derive a very general class 
number formula relating the class number of B to the class number of A 
and certain finite character sums. In Chapter 16 we will look more closely 
into these formulas in the special cases of quadratic and cyclotomic function 
fields and find close analogues to a number of classical results in algebraic 
number theory. 


Let us return to considering S as a finite set of primes of K. In addition 
to the definition of the ring of S-integers we will need a number of other 
definitions. The S-unit group is defined by 


E(S) = {a € K* | ordp(a) = 0, VP ¢ S}. 


It is clear that E(S) = O%, the units of the ring of S-integers. Moreover, 
F* C E(S). We will see that E(S)/F™* is a finitely generated, free abelian 
group. 

Since the field K will be fixed throughout the first part of our discus- 
sion, we denote by D its group of divisors, by P the subgroup of principal 
divisors, and by Cl = D/P the group of divisor classes. The group of S- 
divisors, Dg, is defined to be the subgroup of D generated by the primes 
in Sx — S. Given an element a € K*, we define its S-divisor to be 


(a)s = 3 ordp(a)P. 


P¢S 


A divisor which is of the form (a)gs for some a € K™ is called a principal S- 
divisor. The principal S-divisors form a subgroup of Ds, which is denoted 
by Ps. The quotient group Cls = Ds/Ps is called the S-class group. Later 
we will show that Clg is isomorphic to the ideal class group of the Dedekind 
domain Og. 

Finally, we define D(S) to be the subgroup of D generated by the primes 
in S and P(S) = PND(S). 

Consider the degree map deg : D > Z. The image of this map is a 
principal ideal iZ. The integer i is easily seen to be the greatest common 
divisor of all the elements of the set {deg P | P € Sx}. When F is a finite 
field a theorem of F.K. Schmidt insures that 1 = 1. However, in the general 
case it is quite possible for 7 to be greater than 1. For example, consider 
the quotient field of the integral domain R[X,Y]/(X* + Y?+41). This is a 
function field over the real numbers R as constant field. It is not hard to 
check that every prime has degree 2 and so we must have 7 = 2 for this 
example. 

The image of D(S) under the degree map is also a principal ideal in 
Z which we denote by dZ. The integer d is characterized as the greatest 
common divisor of the elements in {deg P | P € S}. Clearly, 2 divides d. 
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Proposition 14.1. The following sequences are exact: 


(a) (0) > F* + E(S) > P(S) > (0) , 


(b) (0) + D(S)°/P(S) + Cl? 4 Cly > C > (0) , 


where C' is a cyclic group of order d/i. 


Proof. The map from E(S) to P(S) is given by taking an S-unit to its 
divisor. This map is onto by the definition of P(S). If an S-unit e goes 
to the zero divisor, then ordp(e) = 0 for all P € Sx and so must be a 
constant. This proves the exactness of sequence (a). 

To deal with the second exact sequence we first define a map 7:D— Dg 
as follows: 

T(D) = S- ordp(D)P . 
P¢s 


This map is an epimorphism with kernel D(S). The image of P under 
T is Ps. Thus, 7 induces a homomorphism from Cl — Clg with kernel 
(D(S) + P)/P = D(S)/P(S). From this we deduce the exactness of the 
sequence 


(0) > D(S)°/P(S) > Cl? > Clsg , 


and it remains to show that the cokernel of the last arrow is a cyclic group 
of order d/1. 

To do this, we again use the fact that 7 induces an isomorphism from 
D/(P+D(S)) to Clg. The group we are interested in can also be described 
as the cokernel of the natural map from D°/P to D/(P + D(S)). This 
cokernel is easily seen to be isomorphic to D/(D° + D(S)) (use the fact 
that P C D°). The degree map provides an isomorphism of D/(D° + D(S)) 
with iZ/dZ = Z/(d/1)Z. This completes the proof. 

This proof is due, in essence, to F.K. Schmidt. See his classic paper 


(Schmidt [1]). 


Corollary 1. The group E(S)/F™* is a finitely generated free group of rank 
at most |S|—1, where |S| is the number of elements in S. 


Proof. By the exact sequence a) we have E(S)/F* = P(S), which is a 
subgroup of the free group D(S)° on |S| — 1 generators. Thus, P(S) is free 
on at most |.S|— 1 generators. 


Corollary 2. Clg is a finite group if Cl° is a finite group. Also, Clg is a 
torsion group if Cl° is a torsion group. 


Proof. Both statements are immediate consequences of exact sequence b). 


Proposition 14.2. Let K/F be a function field over a finite field F. Then, 
for all finite subsets S C Sx we have that Clg is a finite group and E(S)/F* 
is a free group on |S|—1 generators. 
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Proof. By Lemma 5.6, C/° is a finite group. By Corollary 2 to Proposition 
14.1 we see that Clg is a finite group. 

By exact sequence b) of Proposition 14.1 we see that D(S)°/P(S) is 
finite. This shows that P(S) is free on |S|— 1 generators. We have already 
seen that P(S) = E(S)/F"*. 


It is fairly clear that the above results are analogues of finiteness of class 
number and the Dirichlet unit theorem in algebraic number theory. 

For the rest of the chapter, we will assume that the constant field F = F 
is a finite field with q elements. 

Our next task is to introduce the S-zeta function and investigate some 
of its properties. Recall the definition of Cx (w), 


(x(w)= [] Q-NP-Y)™ . 


PESK 


If S is a finite set of primes, we define the S-zeta function to be 


(s(w)= J] (1- NPY). 


P¢s 


Two remarks are in order about the notation. Since we are not varying 
the field K in the discussion, we write Cs(w) rather than Cx ,5(w). Secondly, 
we will use w as the variable instead of s, which we have used earlier. Among 
other reasons, this is because the notation ¢s(s) is a bit confusing. Also, 
we want to reserve s to represent the number of elements in S, i.e., s = |S]. 

It follows immediately from the definition that 


¢s(w) = |] Q-NP~”) Ce(w) . (1) 


Pes 


Since Cx (w) is a rational function of q~”, the same is true for ¢s(w). We 
will be interested in the power series expansion of ¢g(w) about w = 0. By 
Theorem 5.9 we know that 


Le(q ™) 
(1—-q-“)(—gi™) - 
where Lx(u) € Zlu] is a polynomial with the property that Lx (1) = hx, 
the number of divisor classes of degree zero. It follows (as we have seen 
before) that 


CK(w) = 


hk 
In(q)(q—-1) | 
Since NP~¥ = g~deeP w = e—InqdeeP w we see that 1 - NP-¥ = 


Ingdeg P w+O(w7?). Using this information and substituting into Equation 
1, we find 


¢s(w) = —(q-1)7*hk«( || deg P)(Ing)**w** + O(w’). (2) 
Pes 


dim w¢x (w) = 
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From this we see that the order of vanishing of ¢s(w) at w = 0 is s — 1, 
which is equal to the rank of the S-unit group by Proposition 14.2. We will 
now show how to rewrite the leading coefficient given in Equation 2 so that 
it becomes strikingly close to what it looks like in the number field case. 

An ingredient in the calculation will be the S-regulator. To define this, 
we begin by choosing a set of S-units {e),¢€2,...,€s—1} whose projection 
to E(S)/F* is a basis. Consider the (s —1) x s matrix M whose 1j-th entry 
is Ine;|p,, where S = {8), Po,...,B,}. We claim that the sum of the 
columns of this matrix is zero. To see this, note that for any a € K* we 
have 


—~ J In|alp = S| ordp(a) deg P Ing = deg(a) Ing=0. 
P P 


For any S-unit, the only primes which occur in the sum are the primes in 
S. Our assertion follows. 

It follows that the determinants of the (s — 1) x (s —1) minors of M are 
all the same, up to sign. The absolute value of any of these determinants is 
then taken as the definition of the S-regulator. We denote the S-regulator 
by Rg. It is not hard to show that the S-regulator is independent of the 
choice of basis {e1,€2,...,€s—1}. 

An associated regulator RY has the same definition as Rg except that 
throughout one uses log,(z), the logarithm to the base q, instead of the 
natural logarithm, In(z). The two regulators are related by the equation 


(Ing)* 1 RW = Rg. 


It is worthwhile to give a more direct definition of RY , which has the 
advantage of showing that it is an ordinary integer. Simply notice that 


log, (jelp) = log, (N P~or4P(2)) = —deg P ordp(e) . 


Now, form the (s — 1) x s matrix whose ij-th entry is — deg P, ordp, (e;). 


Then RY is the absolute value of the determinant of any (s — 1) x (s — 1) 
minor of this matrix. 


Lemma 14.3. 
dR 


(IIpes deg P) — 


Proof. We begin by defining a map !: D(S) > Z*. If D € D(S), we set 
l(D) =(...,—ordpD deg P,...), where P varies over the set S. Note that / 
is a homomorphism and that ifa € E(S), then I((a)) = (..., log, |a|p,...). 
Also, it is easy to see from the definition that [Z* : (D(S))] = [[peg deg P. 

Consider the elements of Z* as row vectors and define H° Cc Z* as the 
subgroup consisting of row vectors the sum of whose coordinates is zero. 


[D(S)? : P(S)] = 
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We have 1(P(S)) C l(D(S)°) C H°. It is easy to check that / is one to one. 
It follows that 


O. _ oY _ [f° : U(P(S))| 
DUS)": PS) =1OS)) LPN =a: 3 

We now calculate the numerator and denominator of this expression. 

First we compute the index [H° : 1(P(S))]. Let e, € Z* be the vector 
with zeros everywhere except for a 1 at the s-th place. Then, Z® is the 
direct sum of H° and Ze,. It follows that the index of [(P(S)) in H° is the 
same as the index of I(P(S)) + Ze, in Z*. A free basis for this subgroup 
is {l((e1)),...,1((€s—1)),€5}. Let M“™ be the s —1 x s matrix whose i-th 
row is 1((e;)) and M’ be the s x s matrix obtained from M“?) by adjoining 
€, as the bottom row. By a simple application of the elementary divisors 
theorem (see Lang [4], Theorem 7.8), the index we are looking for is the 
absolute value of the determinant of M’. Expanding this determinant in 
cofactors along the bottom row shows the index in question is RY, 

To compute [H° : 1(D(S)°)], consider the exact sequence 


(0) > H°/l(D(S)°) > Z° /l(D(S)) > Z/dZ = (0) . 


The second arrow is induced by inclusion and the third arrow by the sum 
of coordinates map from Z* — Z. From this exact sequence, we deduce 


_ Ilpeg deg P 


[H? : (D(S)°) ; 


Substituting these results into Equation 3 completes the proof of the 
lemma. 


Corollary 1. Suppose all the primes in S have degree 1. Then, [D(S)° : 
P(S)] = RY. 


(q) 
S 


Corollary 2. Both regulators Rs and Re’ are not zero. 


Theorem 14.4. Let K/F be a function field over a finite field F with q 
elements. Let S C Sx be a finite set of primes with s elements. Then 


hg R 
Cs(w) = we) + Ow"). 


Proof. Referring to Equation 2 we see that everything has already been 
proved except that in that equation the coefficient of w°—! is given as 


—(q-1)"'hx«( |] deg P)(Ing)*? . (4) 
Pes 


Our task is to show that this number is the same as that given in the 
theorem. 
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By Proposition 14.1, part b), we see that hed = hs[D(S)° : P(S)]. 
From Lemma 14.3, we deduce hx ([[peg deg P) = hgRY. Since RY = 
(Ing)-“- Rg, we find that hk ([Ipeg deg P) (Ing)*~! = hg Rg. Substi- 
tuting this into Equation 4 we obtain 


_AsRs 
q-1' 


which proves the theorem. 


The formula in this theorem is remarkably similar to the analogous for- 
mula for the S-zeta function in number fields. For a parallel “number field 
- function field” treatment see Tate [2]. 

The ring of S-integers can be characterized in other ways, as has already 
been suggested. 


Theorem 14.5. Let K/F be a function field with constant field F and let $ 
be a non-empty, finite set of primes. There exist elements x € K such that 
the poles of x consist precisely of the elements of S. For any such element 
x, the integral closure of F|x] in K is Og. Og is a Dedekind domain and 
there is a one-to-one correspondence between the non-zero prime ideals of 
Os and the primes of K not in S. The S-units E(S) are equal to the units 
of Os and the class group of Og, Cl(Og), is isomorphic to Clg. 


Proof. To begin with, let’s label the primes in S, S = {P,, P2,..., Ps}. For 
a large positive integer M consider the vector spaces L(MP;) = 
{x € K*|(x) + MP, > 0}. As soon as M is big enough (say, M > 2g — 2) 
we know from Corollary 4 to Theorem 5.4 that the dimension of this 
space is M deg P; — g + 1. It follows that L(MP;) is properly contained 
in L((M +1)P,). Pick an element x; which is in the latter set, but not in 
the former set. Then x; has a pole of order M +1 at P; and no other poles. 
Now consider x = 1129---x,. Then, x has each element of S as a pole and 
no other poles. 

With z chosen to have poles at the elements of S, and nowhere else, let 
R be the integral closure of F[z] in K. The ring R is a Dedekind domain. 
If K/F (a) is a separable extension, this fact is well known and is proven 
in many places. As is shown in Chapter V, Theorem 19, of Samuel and 
Zariski [1], it remains true even if K/F'(x) is inseparable. If P is a prime 
of K not in S, then x € Op and it follows that R C Op. Thus, 


Rc () Op =Os. 
P€S 


We will show that R = Og. Let P ¢ S be a prime of K and consider 
POR. It cannot be that POR = (0) since otherwise the quotient field of R, 
namely, A, would inject into the residue class field Op /P. However, Op/P 
is finite over F’. Thus, PM R = p is a maximal ideal of R, and Ry C Op. 
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This must be an equality because R, is a discrete valuation ring and so is a 
maximal subring of kK. On the other hand, if p is a maximal ideal of R then 
Ry, is a discrete valuation ring and (pR,, Ry) is a prime of K containing 2. 
This shows that p > (pR,, Ry) is a one-to-one correspondence between the 
maximal ideals of R and the primes of K not in S. Again using the fact 
that R is a Dedekind domain, we find (see Jacobson [2], Section 10.4) 


R= ()\R,=() Op=Os. 
pCR P¢s 


We have shown that Os is a Dedekind domain and that there is a one- 
to-one correspondence between the maximal ideals of Og and the primes of 
K not in S. The remaining statements of the theorem are straightforward 
and we leave them as exercises for the reader. 


It is of interest to see how these general ideas work out in particular 
cases. We will see how they apply in quadratic extensions of F(T’) and in 
the cyclotomic functions fields which were defined and discussed in Chapter 
12. We use the notation given there. 

Let’s assume that q = |F| is odd. Let f(T) € F[T] = A, be a square-free 
polynomial. Define K = k(,/f(T)) (recall that k = F(T)). One sees imme- 
diately that K/k is a Galois extension of degree 2 and that the non-trivial 
element, a, of the Galois group is characterized by o./f(T) = —\/f(T). 
A short calculation, completely analogous to what happens in quadratic 
number fields, shows that the integral closure of A in K, R, is equal to 
A+A,/f(T). 

Recall that the prime at infinity, 00, of k is defined by ord,h = —degh. 
Let U = 1/T. Then, ord,.U = 1, ie., U is a uniformizing parameter at 
infinity. Let d = deg f(T) and rewrite f(T’) in terms of U as follows: 


d d d 
F(T) = Soa T =T*) aT 4 =U4S aU =U“ f(U) . 
T=0 i=0 i=0 
Note that f*(U) € F[U] and that its constant term is ag 4 0, the leading 
term of f(T’). 


Proposition 14.6. Let kK = k(,/f(T) ), where f(T) € A = FIT] ts 
square-free. Let d = deg f(T) and ag the leading coefficient of f(T). If d is 
odd, then oo is ramified in K. If d is even, and ag is a square in F*, then 
oo splits in K. Finally, if d is even and ag is not a square in F*, then oo 
remains prime in K. 


Proof. Suppose d is odd. Since U is a uniformizing parameter at oo and 
aq # 0 is the constant term of f*(U), we see that f*(U) is a unit at infinity. 
Suppose P,, is a prime of K lying above oo. Then, setting e equal to the 
ramification index of P,, over ov, 


1 


ordp,.. /f(T) = sordp,, f(T) = 5 0rdooU~4 f*(U) =—<. 
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Since this number must be an integer and since d is assumed odd, it follows 
that 2\e. Thus, e = 2 and K/k is ramified at oo. 

Now suppose that d is even. Then, K is generated over k by \/f*(U). 
Since f*(U) is square-free as a polynomial in U, it follows that the integral 
closure, R’, of A’ = F[U] in K is A’ + A’,/f*(U). By Proposition 15 in 
Chapter 1 of Lang [5], the prime decomposition of oo follows from that of 
the irreducible polynomial X? — f*(U) reduced modulo U. The reduction 
is simply X? — ag € F[X]. This either splits or is irreducible according to 
whether ag is a square or not in F*. This completes the proof. 


We have given a rather old-fashioned proof. A more modern proof can 
be given using the properties of the completion F((1/T)) of k at oo. 


Following Emil Artin [1], we say that the quadratic function field K = 
k(./ f(T) ) is real if co splits in K and is imaginary in the other two cases. 
This closely follows the terminology in the number field case. 


Let B= A+ A,/f(T) be the integral closure of A in K. We want to 


compare the class number of B , hz, with hx. 


Proposition 14.7. With the above notation, we have hg = hx if oo is 
ramified, hp = 2hx if oo is inert, and hg log, |elp,, = hx if oo splits. In 
the latter case, e represents a fundamental unit in B, and Ps, is the prime 
above co at which e has negative ord. 


Proof. In the first two cases the set of primes above oo consists of one 
element P,.. Thus, s = 1 and the rank of D(S)° is zero. Also, in the first 
case the degree of P,, is 1 and in the second case it is 2. Thus, the first two 
assertions follow from Proposition 14.1, part (b), and Lemma 14.3. 

In the third case, there are two primes above oo, P,, and P!,. Thus, the 
unit group B* has rank 1. Let e be a generator of B* modulo torsion, i.e., 
B* =F* <e>. Ife’ denotes the Galois conjugate of e, then ee’ € F* which 
implies ordp,,¢e + ordp: e = 0. Thus we can chose P,,. to be the prime over 
oo with ordp..e < 0. Both primes above oo have degree 1, so by Lemma 
14.3, [D(S)° : P(S)| = RY = |log, |elp,|. By our choice of P,. we can 
remove the absolute value sign. Now, invoking Proposition 14.1 once again 
gives the result. 


Remark. It is worth pointing out that the expression log, |e|p_, can be 


considerably simplified. Let e = g + h,/f(T), where g,h € F[T]. Then 
e+e’ = 2g, which implies ordyg = ordp,.g = ordp,,(e + e’) = ordp_e = 
— log, |e|p,,. Since ordg. g = —deg g, we arrive at the simple equation 
hy = hpdeg g. 


Now let’s consider briefly the cyclotomic function fields treated in Chap- 
ter 12. Recall that K,, is defined to be k(A,,) where A,, are the m-torsion 
points on the Carlitz module. The ring O,, is the integral closure of A in 
Ky. Let S;, be the set of primes in K,, lying over oo. Then, as we have 
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seen, O,,, is the ring of S,,-integers in K,, and its unit group is the group 
of S,,-units. What is the cardinality of S,,? The answer is implicitly given 
in Theorem 12.14. The fixed field of {o, | a € F*} is denoted by K>. Ac- 
cording to that theorem, oo splits completely in K;, and each prime above 
oo in K;> ramifies totally in K,,. Let S* denote the primes in K;, lying 
above oo. It follows that 

O(m) 


Sal = [Sm] = [Bin 2B] = 


From this information it also follows that each prime in K,, lying above oo 
has degree 1. Thus, 


ti * * * Tx &(m) 
Proposition 14.8. The groups O*, /F* and O*,"/F* are free of rank a 
1. Moreover, hx, = ho, RY and hye = hor RY ; 


Proof. With the information already provided, the proof is a straightfor- 
ward application of Propositions 14.1, 14.2, and Lemma 14.3. 


In Chapter 16 we will investigate the class numbers for quadratic and 
cyclotomic function fields in greater detail. A fundamental tool will be 
Artin L-functions and their properties in the special case where the Galois 
group is abelian. Some of this was already discussed in Chapter 9. We will 
provide a short review these ideas. 

Let K/k be an Galois extension of global function fields. The number of 
elements in the constant field of k, F, will be denoted by q (as usual). We will 
not suppose that k is a rational function field. Let G denote the Galois group 
of K/k. We suppose that G is abelian. If P is a prime of k and $B is a prime 
of K lying over P, then the decomposition and inertia groups, Z7(%8/P) 
and I(§B/P), are independent of $8 (because G is abelian). We denote 
them more simply by Z(P) and I(P). We recall that |Z(P)| = e(P)f(P) 
and |I(P)| = e(P). Here, e(P) = e($/P) and f(P) = f($%/P) are the 
ramification index and relative degree of $% over P. We also know that 
Z(P)/I(P) is cyclic, being isomorphic to the Galois group of the residue 
class field extension. 

If $/P is unramified, then the Artin automorphism (P, K/k) € G gen- 
erates Z(P) and is characterized by the congruence 


(P,K/k)w =w%P (mod $) , 


where w is any element of K integral at {B. 

Let P be any prime of k and x € G a one-dimensional character of G. We 
want to define x(P). If P is unramified in K, we set y(P) = x((P, K/k)). 
If P is ramified, suppose x(I(P)) 4 1. In this case we say that y is ramified 
at P and set y(P) = 0. If x(1(P)) = 1, then y is a character on G/I(P). 
Let the fixed field of [(P) be denoted by M. Then, Gal(M/k) = G/I(P). 
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Under these conditions, set x(P) = x((P,M/k)). We have now defined 
x(P) for any prime P and we define the Artin L-function of x to be 


L(w,x) = [| @-x(P)NP-”)*. 
PES; 

When y = xo, the trivial character, we see that L(w, Xo) = ¢e(w), which 
has a simple pole at w = 1 but is analytic everywhere else. If y # Xo, then 
L(s, x) is entire (as follows from the fact that it can be identified with a 
Hecke L-function; see Chapter 9). The following proposition is a special 
case of a more general result about Artin L-functions. Since we will use it 
so often, we provide the relatively simple proof. 


Proposition 14.9. With the above notations, we have 


Cx(w) = Ge(w) [] Lw,x) . (5) 

X#Xo 
Proof. By looking at the product decompositions on both sides we see that 
it is sufficient to prove the following “semi-local” identity for each prime P 


of k. 
[[a@-Np’) = [[a-x(P)NP™). (6) 
B|P xEG 
Let e = e(P), f = f(P), and g = [K : k]/ef. We see that g is the number 
of primes of K lying above P. The left-hand side of Equation 6 is thus 


(1—-NP-f¥)9 , 


We want to show that the right-hand side of Equation 6 is equal to this 
same expression. Note first of all that if x([(P)) #1, then 1-x(P)NP-* = 
1. Thus, the right-hand side is equal to 


T] G@-x(P)NP-’). 


x€G/I(P) 


As before, let M be the fixed field of I(P). Then, by definition, y(P) = 
x((P, M/k)) an f-th root of unity. Every f-th root of unity determines a 
unique character of the subgroup of Z(P)/I(P) generated by (P, M/k) and 
each such character will extend in g = [G: Z(P)] = [G/I(P) : Z(P)/I(P)] 
ways to a character of G/I(P). Thus, 


f-1 
I] G@-x(P)Ne) =][a-GNp-’) =(-NP 9. 
x€G/I(P) =0 


This concludes the proof. 


The proof is a little easier to see when P is unramified in K, but it is 
important to include all the primes in the definition of L(w, x). 
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Corollary 1. Suppose K/k is abelian and geometric, 1.e. that there ts no 
constant field extension. Then 


hk =he [| L(O,x). 
X#Xo 
Proof. By ‘Theorem 5.9, we have 
Lk(q-™) 
(L—q-”)(1—gi™) ° 
where Lx(u) € Zlu] is such that Dx (1) = hr. 
By the assumption that there is no constant field extension in K/k we 


can multiply both sides Equation 5 in the statement of the proposition by 
(1 —q~”)(1 — q'~”) to derive 


Lk(q”) =Lilg™) [] Lv, x) . 


X#FXo 


CK(w) = 


Now substitute w = 0 to get the result. 
Corollary 2. For y # xo we have L(0,xv) #0. 
Proof. This follows immediately from Corollary 1. 


Remark. From Corollary 1, we can infer that hy | hx. In fact, hx /hz is 
a rational number equal to J], L(0,x), which is in Z|¢,], where n = 
[K : k]. This is because L(s, x) for vy # Xo is a polynomial in g~* with co- 
efficients in Z[¢,,]. A rational number which is simultaneously an algebraic 
integer is a rational integer, which proves the assertion. By using formal 
properties of Artin L-functions, one can show in this way that for any finite 
extension K/k of global fields, the class number of k divides the class num- 
ber of K. This fact was first shown by M. Madan [1] using cohomological 
methods. His proof is actually much more elementary than the analytic one 
we have just sketched. 


For the remainder of the chapter, we will be concerned with finding a 
class number formula similar to that given in the above Corollary 1, but for 
the class number of the ring of S-integers rather than the group of divisor 
classes of degree zero. 

Let K/k continue to denote a geometric, abelian extension of global 
function fields with Galois group G. Let S denote a finite set of primes 
of k and S’ the set of primes of K lying above those in S. Let A Ck 
denote the ring of S-integers in k and B Cc K denote the ring of S’-integers 
in K. By Theorem 14.5, both A and B are Dedekind domains. Using the 
method of proof of that theorem, it is not hard to see that B is the integral 
closure of A in kK. We denote by hy and hg the class numbers of A and B, 
respectively. In the special case where k = F(T), S = {oo}, and A = FT] 
we have hy = 1. 
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We have previously defined S-zeta functions. We now define S-L-functions 
in an analogous way. Namely, for x € G, define 


Ls(w,x) = [[ Q-x(P)NPoY)* 


P¢s 


By Theorem 14.5 there is a one-to-one correspondence between the primes 
in S, not in S and the prime ideals of the ring A. Thus, it is natural to 
think of Ds(w,x) as the L-function corresponding to the ring A. We set 
Ls(w,x) = La(w,x) and work primarily with the latter notation. 

In the following proposition we will need the definition of the Artin con- 
ductor F(v) of a character y. Artin gave a definition in great generality. 
It applies even if the Galois group G is not abelian. In the abelian case, 
which is treated briefly in Chapter 9, F(x) is defined to be the minimal 
effective divisor F such that x is trivial on the ray modulo F, P*. Recall 
that P* is the group of principal divisors generated by elements a € k* 
such that ordp(a —1) > ordp¥F for all primes P in the support of F. ‘That 
some effective divisor F exists with the property that y vanishes on P” is 
part of the statement of the Artin reciprocity law, Theorem 9.23. It is then 
an exercise to show there is a unique minimal one with this property. 


Proposition 14.10. L4(w, x) is a polynomial in q~™” of degree d(x) where 


d(x) =2g-2+degF(x)+ S > degP. 
PES(x) 


Here, g is the genus of k, F(x) is the Artin conductor of x, and S(x) CS 
is the set of primes in S at which x is unramified (i.e., x(I(P)) = 1). 


Proof. From the definition of D4(w, x) we have 


(w,x) = T[ 0 -x(P)NP™) L(w, x) | 


Pes 


By a famous result of A. Weil [1], we know that D(w, x) is a polynomial in 
q~” of degree 2g — 2+ deg F(x). It remains to examine the factors of the 
product over the primes in S. 

If x is ramified at P we have x(P) = 0, so these terms do not contribute. 
If x is not ramified at P, we have y(P) 4 0 and sol—x(P)NP-Y = 
1—x(P)q7¥ 48, which is a polynomial of degree deg P in q~”. The result 
follows from this. 


Proposition 14.11. We have 
¢a(w) = Ca(w) T] La(w,x) - (7) 
XFXo 


Proof. This assertion follows immediately from the definitions and the 
method of proof of Theorem 14.9. The method there uses the semi-local 
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identity given in Equation 6. We simply use that identity for all primes not 
in S, take the inverse of both sides, and then multiply over all primes not 


in S. 


We want to use Equation 7 together with Theorem 14.4 to get a class 
number formula. An important first step is to find a formula for the order 
of vanishing of La(w, x) at w = 0. 


Proposition 14.12. Suppose x # xo and let m(x) denote the order of 
vanishing of La(w,x) at w = 0. Then, 


mx) =#{P ES | x(Z(P)) =I}. 


Proof. From the definition, 


La(w,x) = [] @—x(P)NP~’)L(w, x) - 
Pes 


Since L(0,x) #0 by Corollary 2 to Proposition 14.9, we see that m(x) is 
just the number of P € S such that y(P) = 1. This only happens when yx 
is unramified and is trivial on (P, M/k) (recall that M is the fixed field of 
I(P)). Since the Artin automorphism at P generates Z(P)/I(P) Cc G/I(P) 
these conditions are equivalent to x(Z(P)) = 1. 


We have now assembled all the background necessary to prove the main 
result of this chapter. However, we need one more piece of notation. For 
a character y of G and a prime P € S) we have defined y(P). We now 
extend this definition to divisors D € Dy. If D = ) > a(P)P € Dg, set 


x(D) =|] xP). 
P 


Theorem 14.13. (The Analytic Class Number Formula) Let K/k be a 
geometric, abelian extension of global function fields. Let S be a finite set 
of primes of k, A the ring of S-integers and B the integral closure of A in 
K. Set RY = RY) and RY = RY, where S' is the set of primes of K 
lying above those in S. Then 


hpRY? =haRY TJ Cy, 
X#Xo 
where Oo) 
(—1)™\ 
C = mx) ; 
aie XD) den) 
deg D<d(x) 


Here D runs over all effective divisors of k which are prime to S and of 
degree less than or equal to d(x) (defined in the statement of Proposition 
14.10). Alternatively, one can think of D as running through all integral 
ideals of A with dimp(A/D) < d(x). The number m(x) is defined above. 
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Proof. By Proposition 14.12 we have 
Lalw, x) = ewe) + O(w™O+) , (8) 


where c, is a non-zero constant. Combining this with Equation 7 of Propo- 
sition 14.11 and the assertion of Theorem 14.4 yields the following identity: 


hpRp = haRa I] Cy . (9) 
X#Xo 


We have set Rs: = Rp and Rs = Ry. The same process shows the following 
fact (which can also be proved directly), 


IS'} =|S|+ D> my). (10) 
X#Xo 


From Equation 8 we see that c, is the m(x)-th derivative of L4(w, x) 
evaluated at w = 0 divided by m(x)!. By Proposition 14.10, we know that 
La(w,x) is a polynomial in q~” of degree d(y). Thus, 


La(w,x) = S22) =O (Dyan 8? 
D D 


where the sum is over effective divisors prime to S and of degree < d(x). 
Thus, with the same restrictions on the sum we find 


1 qm 


1 
~ fe =-—_y _ m(x) 
Cy mol dw (La(w, X)|w=0 moo! Z x(D)(— deg D In gq) 


Notice that c, = (In q)™C,, where C,, is defined in the statement of 
the theorem. Combining this remark with Equations 9 and 10 yields 


hpRp =haRa(n q)*~* [] Cy. 


X#Xo 
where s’ = |S’| and s = |S]. 
The result now follows from the fact (see the remarks preceding Lemma 


14.3) that (In q)*-! RW = Rp and (In g)S 1 RY = Rag. 


In the number field case the situation is similar, but more complicated. If 
kK /k is an abelian extension of number fields we can again choose a finite set 
S of primes of k and form S-units, S-class groups, S-L-functions, etc. Here 
it is standard to include in S at least the primes which ramify in K and (1) 
the archimedean primes. The local factors at the non-archimedean primes 
look exactly like their counterparts in the function field case, 1—y(P)NP~°, 
and are handled similarly. On the other hand, the local factors at the 
archimedean primes involve the I’-function, and this adds another level of 
complexity. The use of the I-function in “the local factors at infinity” is 
seen most clearly in the famous thesis of J. Tate [1]. An exposition is found 


in Chapter XIV of Lang [4]. 
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Exercises 


1. 


10. 


Let S' be a finite set of primes in a global function field. In the def- 
inition of the S-regulator, Rg, we began by choosing a set of units 
{€1,€2,...,€s—1} whose cosets in E(S)/F* form a free Z-basis. Show 
that Rg is independent of this choice. 


. In the proof of Lemma 14.3, we defined a map !: D(S) — Z?. Prove 


the assertion that [Z : 1(D(s))] = [peg deg P. 


. Prove the last assertions of Theorem 14.5. Namely, prove that E(S) 


is the group of units of Og and that Clg is isomorphic to the ideal 
class group of Og. 


. Let F be a finite field of characteristic different from 2. Let f(T) € 


A = F|T] be a square-free polynomial and let B be the integral closure 
of Ain F(T)(,/f(T)). Show that B= A+ A,/f(T). 


. Prove Proposition 14.6 by considering the completion of k = F(T) at 


00, kg, and the extension of k,, generated by the roots of X?— f(T). 


. Let S be a finite set of primes in a global function field K. Suppose 


all the elements in S have degree 1. Show that hx = hsRY. 


. Let k = F(T) and let S = {Po, Po}, the set consisiting of the prime 


at 0 and the prime at oo. Show that Os = F/T,T~']. What is E(S) 
in this case? 


. (Continuation) Let f(T) € F{T] be a square-free polynomial of even 


degree whose constant coefficient and leading coefficient are both 


squares in F*. Show that both Pp and Py split in K = k(,/f(T)). 


. (Continuation) Let B be the integral closure of F(T, T~+] in K. Show 


that ; 
hpRY = 5 Dx(9)(dee g)* 


where the sum is over all polynomials g(T) with degg < deg f and 
g(0) #0. Here, x(g) means x of the divisor dupes dp (g)P. 


Redo the last three exercises under the assumption that S constitutes 
all the primes of k of degree l, ie, S={Py |aeF}U{P.}, where 
Py is the prime corresponding to the localization of F(T] at (T — a). 


lo 


The Brumer-Stark Conjecture 


This chapter is devoted to the explanation and, in special cases, the proof of 
a conjecture which generalizes the famous theorem of Stickelberger about 
the structure of the class group of cyclotomic number fields. This important 
conjecture, due to A. Brumer and H. Stark, is unresolved in the number 
field case. The analogous conjecture in function fields is now a theorem due 
to the efforts of J. Tate and P. Deligne. A short time after Deligne completed 
Tate’s work on this result, D. Hayes found a proof along completely different 
lines. We will give a proof for the cyclotomic function fields introduced in 
Chapter 12. We will do so by using a method of B. Gross which combines 
the approaches of Tate and Hayes as they apply in this relatively simple 
special case. The use of 1-motives, which is essential in Deligne’s work, will 
not be needed here. 

Before beginning, it will be useful to give an outline of this chapter. 
We start with some generalities about groups acting on abelian groups, the 
group ring and its properties, and a review of the orthogonality relations for 
group characters. After these preliminaries we will discuss Gauss sums and 
their prime decomposition in cyclotomic number fields. This culminates in 
the statement of Stickelberger’s theorem. We then formulate the Brumer- 
Stark conjecture for both number fields and function fields (i.e., for all 
global fields). For abelian extensions of the rational numbers Q we show 
that the Brumer-Stark conjecture is a simple consequence of the theorem of 
Stickelberger. Finally, we come to the main result of this chapter, the proof 
of the Brumer-Stark conjecture for cyclotomic function fields. The proof is 
in two parts. The first is a general result due to Tate which asserts, roughly 
speaking, that the generalized Stickelberger element annihilates the group 
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of divisor classes of degree zero (of a global function field). The second 
part involves determining the prime decomposition of a torsion point of 
the Carlitz module. Since this decomposition is implicit in the results of 
Chapter 12, we prove the second part first. We then conclude the chapter 
with a proof of Tate’s result. This proof will be somewhat incomplete, 
because it relies heavily on work of A. Weil. Weil’s results will be stated, 
but not proved, since this would require advanced methods of algebraic 
geometry. Accepting Weil’s results as given, Tate’s proof is very beautiful 
and ingenious. It can be described as a sophisticated application of the 
Cayley-Hamilton theorem of linear algebra. 


Let V be an abelian group which is acted on by a finite group G. In 
other words, we are given a homomorphism p: G — Aut(V). Given this 
data, there is a canonical way to make V into a module over the group 
ring Z[G]. Recall that the elements of Z[G] are formal linear combinations 
of group elements, )) <q a(a)o, with coefficients a(a) € Z. The addition 
of two such elements is done coordinate-wise. The product is given by the 
following formula: 


(S- a(o)o) (> b(r)r = > ( S- a(o)b(r) )+ . 


a€G TEG yEG oa,TEG 
oT=¥ 


With these conventions, let }),-¢a(a)a € Z[G] and v € V. Then define 


(S> a(e)7) (v) = Sao) plo)(v) , 


gEG o€G 


It is a simple matter to check that with this definition, V becomes a Z/G] 
module. 

It is cumbersome to write p(a)(v). We often accept p as fixed and write 
more simply p(c)(v) = ov. 

Another notational convention is worth mentioning. Suppose that the 
group operation in the abelian group V is written multiplicatively instead 
of additively. Then the group ring acts according to the following formula: 


(s- a(o)o) (v) = I] (av) , 


ocG aEG 


An example of when this notation is appropriate is the case where K/k is 
a Galois extension of number fields, G is the Galois group, and V is the 
ideal class group of K. 

It is often useful to generalize these notions by assuming that V is not 
just an abelian group, but a module over a commutative ring R. In this 
case, we assume p maps G to Autr(V), ie., that the actions of G and 
of R on V commute with one another. In this case the action of G on 
V extends to an action of the group ring R[G] on V in exactly the same 
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manner as outlined above in the case R = Z. Of course, the group ring 
R{G] consists of formal R-linear combination of group elements. Addition 
and multiplication are given by the same formulas as in the case of the ring 
Z\G}. 

Let f : G— R* be a homomorphism from G to the group of units of R. 
This is easily seen to extend to a homomorphism of rings from R[G] to R 
by means of the formula 


Oy a(o)o) = S- a(a)f(c). 


cEG oEG 


Conversely, if such a homomorphism of rings (more precisely, R-algebras) 
is given, then, by restricting to G, one gets a homomorphism of groups 
G— R*. 

Let’s now specialize somewhat. We will assume that G is abelian. Set 
|G| = n and suppose that n is a unit in R. Suppose further that R is 
an integral domain and that R* contains an element of order n. These 
assumptions are satisfied if R is an algebraically closed field of characteristic 
zero. If R is an algebraically closed field of characteristic p > 0 and p does 
not divide n, then, once again, both assumptions hold. 


Proposition 15.1. The group G= Hom(G, R*) is isomorphic to G. 


Proof. (Sketch) The proof is very simple in the case that G is a finite 
cyclic group. The general case is handled by use of the theorem that a 
finite abelian group is isomorphic to a direct sum of cyclic groups. See 
Lang [4] for details. 


Corollary. |G| = |G|. 


The elements of G are called characters of G and the groups G is called 
the character group of G or, sometimes, the dual group of G. 


Lemma 15.2. Let G be a finite abelian group ando € G,o F# e€, the 
identity element of G. Then there is ax € G such that y(o) # 1. 


Proof. (Sketch) Suppose x(a) = 1 for all x € G. There is a natural homo- 


morphism from G/(a) —> G which, under our assumptions, would be onto. 
This contradicts the corollary to Proposition 15.1. 


Proposition 15.3. (The Orthogonality Relations) Let G be a finite abelian 
group of order n. Ifo,7 € G, then 


(a ~ S> x(o)x(r) = 6(0,7) 


xEG 


where 5(o,7) =1 ifo =7 and is 0 otherwise. If x, € G, then 
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(6) — > x(o7 le) = 5(x,¥) 


where d(x,W) = 1 ifx =w and is 0 otherwise. 


Proof. To prove the first relation, let y € G and set T(7) = ) ca x(7). If 


7 =e, the identity of G, then clearly, T(e) =n. If y #e there isaweG 
such that ~(y) 4 1 by Lemma 15.2. We have, 


ONT) = (7) So x7) = SO) 7) = TO) . 


XEG xEG 


Thus, (#%(y) — 1)T(y) = 0 and so, T(y) = 0. In general, given 0,7 € G set 
y =o7'r and note that for all characters y, x(a~!T) = x(a71)x(rT). This 
proves the first relation. . 

The proof of the second relation is similar. Choose an element \ € G 
and set S(A) = digegX(o). If A = Xo, the trivial character (x.(o0) = 1 for 
all a € G), then, clearly, S(y.) =n. If A xo, then there is a rT € G such 


that A(T) 4 1. We have, 


A(r)S(A) = A(t) So A(o) = SS A(T) = 


aeEG ceEG 


0. In general, if y,w € G, set 


XHo)W() = XIo “1)y(o). The 
this 


Thus, (A(7T) — 1)S(A) = 0 and so S(A) 
A= x7 yp. Then, (7) = (x7"¥)(e) = 


second relation follows immediately from 


We have assumed that G is a finite, abelian group. For any finite group 
one can define irreducible characters and prove orthogonality relations 
which generalize those given in Proposition 15.3. We will have no need 
for this generalization in this chapter. We have discussed this situation in 
Chapter 9. The interested reader can find an elegant presentation of this 
topic in Serre [3]. 

Let V be an R[G] module and x € G. Define V(x) = {u EV | ov = 
x(o)v, Vo € G}. The R-submodule V(x) is called the y—th isotypic com- 
ponent of V. Under the assumptions on G and R that we have made, we 
will show that V is the direct sum of the isotypic components V(x), as xy 
varies over G. This useful result is proved using certain idempotents in the 
group ring R{G] which we will now define. 

Let x € G and define e(x) € R[G] by the following formula: 


| -1 
a S- x(o )o 
aE€G 


Since we are assuming that n = |G| is a unit in R, the formula does indeed 
define an element of R[G]. 
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Lemma 15.4. 

1) For allo € G, we have ce(x) = x(a)e(x) . 

2) For ally, € G, we have E(x je() = d(x, W)e(x) . 

3) re: E(x) =e, the identity of G (and, also, of R{G)). 

4) For x,y € G we have x(e(#)) = 6(x, Y). 

5) The set {e(x) | x € G} is a free R basis for the group ring RIG}. 
Proof. To prove part 1, let 7 € G and calculate 


ex) =7= oxo No = = T xlr)x(r)x(o™)ro = 


ceEG oeG 


To prove part 2, we use part 1 and the orthogonality relations as follows: 


e(x)e(w ~ S> x(a ~")oe(b) = 


” 5EG 
€ S- x(a" )¥(e) Je() = 6(x, Wey) . 
cEG 


The proof of part 3 is another application of the orthogonality relations. 
We calculate again 


YS ed = (=O xloo) 


xEG xXEG o€G 
= DEX aerv)o= Dave 
cE€G XEG 


The property 4 is just a restatement of the second orthogonality relation. 
To see this, note that 


(e(v)) = x(= YL Me 2) = Ewe x(o) = 500) 
ae€G 


Finally, to prove property 5 note first that by part 1, RiGIe(x) = Re(x). 
From this and part 3 we see that the set {e(xv) | x € fea) spans R|G] over 
R. The linear independence follows immediately from 2. 


For the sake of clarity, in the following proposition we restate the hy- 
potheses under which we have been operating. 
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Proposition 15.5. Let G be a finite abelian group of ordern, R an integral 
domain whose units, R*, contains an element of order n. Assume also that 
nisaunitin R. Let V be an R[G] module. Then V is the direct sum of tts 


tsotypic components V(x). In other words, 


Ve BD V(x) 


xEG 


Proof. To begin with, we claim that V(v) = e(x)V. If uv € V, consider 
€(x)v. By the first part of the above lemma, we see that ae(x)u = x(a)e(x)u 
for all o € G. This shows that e(x)V C V(x). If vu € V(x), then 


e(x)v = + >, (x(o7)o)u ss 


oeEG 


~(S> x(o7 1) (0) Juv. 


oEG 


This shows V(x) € €(x)V, so our claim is proved. We have also shown that 
é(x) acts as the identity on V(x), a fact which we will use shortly. 

From the above Lemma, part 3, we see that for allu Ee V,v = 0 eg e(x)u. 
This shows that V is the sum of its isotypic components. It remains to show 
that the sum is direct. Suppose that for each x € G we have an element 
vy € V(x) and that > 4 vy, = 0. Then, for each 7) € G we have 


xEG 
0 = e($)(D~ vy) = Do e(p)vy = 
xEG xXxEG 
Dd e(Pe(x) vx = e(h)oy = vy 
xEG 


We have used part 2 of Lemma 15.4 and the fact that e(y) acts like the 
identity on V(x). This completes the proof. 


We have now presented all that we shall need from abstract algebra. Our 
next goal is to recall the relevant definitions and state the classical theorem 
of L. Stickelberger on the prime decomposition of Gauss sums. The details 
of this development and the proofs can be found in Ireland and Rosen [1]. 
Other sources are Lang [6] and Washington [1]. 

For every positive integer m let ¢,, denote the complex number em. Let 
Km = Q(¢m) and denote by D,, the ring of algebraic integers in K,,. Dm 
is generated, as a ring by Cn, i.e., Dn = Z[¢,,]. We can assume that m # 2 


m+2 
(mod 4), since if m =2 (mod 4) then Gn 2 = C4, and Gm = —Cnjg and so 
Km = Km2. With this convention, a prime p € Z is ramified in K,, if and 
only if p|m. 
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Assume that p € Z is a prime which does not divide m and that P C 
Dm is a prime ideal lying above pZ. D,,/P is a finite field with NP 


p! elements, where f is the smallest positive integer such that p’ = 1 
(mod m). If a ¢ P there is a unique integer i such that 0 <i < m and 
NP—1 


a m= =C (mod P). 


We set (@/P)m = C1, and call (a/P)m, the m-th power residue symbol. If 
a € P, we set (a/P)m = 0. The m-th power residue symbol has a number of 
arithmetically interesting properties. For our purposes, the most important 
is that a + (a/P),, induces a homomorphism from (D,,/P)* > (Gm), i-e., 
a character of the multiplicative group of D,,/P. Let Trp be the trace map 
from D,/P to Z/pZ and define 


(P)= DY) (a/P)AGrre™. 


a€(Dp/P)* 


The quantity g(P) is called the Gauss sum associated with the prime ideal 
P. We further define 6(P) = g(P)™. These quantities possess the following 
properties 


Proposition 15.6. 
1) g(P) € ACms Sp). 
2) ®(P) € Qn). 
3) |g(P)° = NP. 


The proof of part 1 is immediate from the definition. Part 2 is somewhat 
surprising. The proof uses Galois theory. Part 3 is a standard property of 
Gauss sums. For details see Ireland and Rosen [1], Proposition 14.3.1. 


The goal we are after is the prime decomposition of ®(P) in D,, where 
P is any prime ideal not containing m. From part 3 of Proposition 15.6. we 
deduce that 6(P)®(P) = NP™ = p!™, It follows that the primes which 
divide (®(P)) are primes in D,, lying over pZ. Since Q(¢m) is a Galois 
extension, the primes above pZ are all conjugates of P. We thus take a 
moment to recall the explicit description of the Galois group of Q(¢m) 
which we gave in Chapter 12. 

If ¢ € Z is relatively prime to m, there is a unique automorphism o; in 
Gm = Gal(Km/Q) with the property o1(¢m) = ¢¢,. The map t > o; gives 
rise to an isomorphism (Z/mZ)* = Gin. 

We can now state 


Theorem 15.7. (L. Stickelberger): 


m—1 m—1 


(O(P))= |] (oy "Py! 


t=1 t=1 
(t,m)=1 (t;m)=1 


| 
——, 
oh 
Q 
+ | 
—y 
Ne 
ae) 
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This theorem dates from the 19-th century. In the case where m is a 
prime it was formulated and proved by E. Kummer (1847). Stickelberger’s 
generalization came 43 years later (1890). 

We have seen that the prime decomposition of ®(P) involves only primes 
above pZ and that these are all conjugates of P under the action of the 
Galois group G,,. This is quite elementary. The remarkable feature of Stick- 
elberger’s theorem is that the same element of the group ring Z[G] describes 
the prime decomposition for all P not containing m. If we use the known 
fact that every element of the class group Clx,, contains infinitely many 
prime ideals, we derive the following important corollary. 


Corollary. The element 


m—l1 


NS) top) € Z[Gmn] 
t=1 


(tsm)=1 
annihilates the class group Clr. 


For the proof of Stickelberger’s theorem and some of the many important 
applications, see Ireland and Rosen [1], Lang [6], and/or Washington [1]. 


The goal of the Brumer-Stark conjecture is to generalize the above results 
to an arbitrary abelian extension of global fields K/k. If G = Gal(K/k), 
we are looking for an element of Z[G] defined in some canonical way which 
annihilates the class group of K in the number field case and the divisor 
class group of K in the function field case. This canonical element should 
essentially be the one given in the above corollary when K = K,, and 
k = Q. Brumer (unpublished, but see Coates [1]) was the first to suggest a 
candidate for such an element. We now describe the background necessary 
to write this down. 

Let K/k be a finite abelian extension of global fields of degree n, and 
G its Galois group. Let S be a non-empty finite set of primes of k which 
contains all the primes which ramify in K and, in the number field case, 
all the archimedean primes. If y : G — C* is a complex valued character 
on G we defined the S-L-function, Ls(w, x), in Chapter 14 as follows: 


Ls(w,x) = [[G-x(P)NP™”) = [[ (G-x(P)NP~”) L(w,x), 
P¢s Pes 


P non—arch 


where L(w, x) is the complete Artin L-function attached to x. 

For the rest of this discussion the ring R will denote the ring of complex- 
valued meromorphic functions on the complex plane. It satisfies all the 
hypotheses we need; it is an integral domain, it contains n n-th roots of 
unity, and n is a unit in R. 
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Definition. The L-function evaluator 0%/,%,9(w) € R[G] is defined as fol- 
lows: 
6(w) = Ox k,s(w) = >> Ls(w, X)e(x 
xEG 


Proposition 15.9. For ally € G we have x(O(w)) = Lg(w, Xx) . 


Proof. This is an immediate consequence of the definition of 6(w) and 
Lemma 15.4, part 4. 


This proposition explains the designation of @(w) as the L-function eval- 
uator. 

One can rewrite the definition of @(w) in terms of partial zeta functions. 
For o € G the definition of the partial zeta function ¢s(w,a) is given by 
the sum 

(s(wjo)= So ND. 

D,(D,S)=1 

(D,K/k)=o 
Here, in the function field case the sum is over all effective divisors whose 
support contains no prime in S and whose Artin symbol, (D, K/k), is equal 
to o. In the number field case the sum is over all integral ideals in the ring 
of integers of k which are prime to S and for which the Artin symbol, 
(D, K/k), is equal to o. 

In all cases, the sum is absolutely convergent in the region R(w) > 1 
and all these functions can be analytically continued to the whole complex 
plane with at most one simple pole at w = 1. The facts are reduced to 
known properties of zeta and L-functions by the following proposition. 


Proposition 15.10. With the above definitions and notations we have 


Ls(w,x) = >> x(e)6s(w, 0) (1) 


cEG 
and 
~ J x(a) x(7) Ls (w, x). (2) 
"eG 


Proof. From the definition of Ds(w,x), we find (summing over effective 
divisors or over integral ideals, prime to S) 


D,K/k 1 
yD Exe) ope 


(D,S)=1 oEG (D,S)=1 


(D,K/k)=o 
S— x( xX(7)Cs(w,o) . 
aéG 


Ls(w, x) 


266 Michael Rosen 


This shows that the sum defining ¢s(w, c) is, essentially, a subsum of that 
defining Ds(w,x). The latter sum is absolutely convergent in the region 
R(w) > 1 and so the sum defining ¢s(w, a) is absolutely convergent in this 
region as well. 

To prove Equation 2 simply choose a 7 € G, multiply Equation 1 on 
both sides by x(77~!)/n = x(7)/n, sum over all x € G, and use the first 
orthogonality relation. 

From Equation 2 we see that ¢s(w,a) can be analytically continued to 
the whole complex plane and is holomorphic everywhere except for a simple 
pole at w = 1 (corresponding to the simple pole of Lg(w, x.) at w = 1). 


We can now give the promised alternate expression for the L-function 
evaluator 0(w). 


Proposition 15.11. 


O(w) = Ox/x,s(w) = > Cs(w,o)o™ . 


GEG 


Proof. Define 6(w) = deca $s(w,a)a7' € ml By Equation 1 of the 


previous proposition, we find that y(@ Ae )) = Ls(w,X) (we have used 
x(o~') = x(c)). It follows that x(0(w) — ae) )) = 0 for all y € G. As 
we will see in a moment, this implies a ) = Ow). 


Suppose f € RIG has the property that x(f) = 0 for all vy € G. Write 
f =o rye(x) with r, € R (that this is possible follows from Lemma 15.4, 
part 5. Let » € G and apply w to both sides of this equation. We find 

= ry (by Lemma 15.4, part 4. Since this is true for all w € G it follows 
that f = 0. 


The values of the partial zeta functions ¢s(w,a) at w = 0 are especially 
important. It turns out that they are rational numbers and we have good 
control of their denominators. More precisely— 


Theorem 15.12. 


(a) Cs(0,0o) EQ. 


(0) WCs(0, 0c) EZ. 


where Wx denotes the number of roots of unity in K. 


This theorem is quite deep. Part a was first proved, in the number field 
case, by C.L. Siegel [1] and part b was first proved, in this case, by P. Deligne 
and K. Ribet [1]. Other proofs of both results appeared soon thereafter, 
e.g., by D. Barsky and by P. Cassou-Nogués. Part a remains true when 0 is 
replaced by a negative integer —n and part b remains true if we replace 0 
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by —n and Wx by wi) an integer also defined in terms of roots of unity. 
For details and applications see the instructive article of J. Coates [1]. 

We will give a proof later in the function field case using the fact that 
for non-trivial linear characters, x, Ds(w,x) is a polynomial in q~”. 
Definition. Define OK/k,S = Ox /k,s (0) and WK/k,S = WOK /x,8 . The 
element w/,,5 is called Brumer element of K/k relative to S. 


It follows from Proposition 15.11 and Theorem 15.12 that 0x74, € Q|G| 
and that wx/%,9 € Z[G]. For the most part we will fix the abelian extension 
K/k and the non-empty set of primes S, so we will call these elements 
simply @ and w. 

We are now in a position to state the Brumer-Stark conjecture in both 
number fields and function fields. 


The Brumer-Stark Conjecture (The Number Field Case). We suppose 
that |S| > 1. Then, for every fractional ideal D of K we have wD = 
(ap) where ap € K* and ap has absolute value 1 at all archimedean 
primes. Moreover, if Ap is aWx-th root of ap, then K(Ap)/k is an abelian 
extension. 


Since the divisor of ap is determined, ap is determined up to a unit in 
O}. The supplementary restrictions on ap insure that it is well defined up 
to a root of unity in K. 


The Brumer-Stark Conjecture (The Function Field Case). Suppose 
first that |S| > 1. Then, for every divisor D of K, we have wD = (ap) 
with ap € K*. IfAp is a WK-th root of ap, then K(Ap)/k is an abelian 
extension. If S = {$3}, then for every divisor D of K, there is an integer 
np € Z and an element ap € K* such that wD = (ap) + ND diggin B. 
Once again, if Ap is any Wx-th root of ap, then K(Ap)/k is an abelian 
extension. 


In both the number field and the function field case, it is easy to see that 
the conditions imposed on ap determine it up to multiplication by a root 
of unity. The same is true for Ap. The question of whether K(Ap)/k is or 
is hot abelian is not affected by this ambiguity. 

In both versions, the conjecture that w annihilates the class group is due 
to Brumer and the conjecture that K(Ap)/k is abelian is due to Stark. 

We now show how Stickelberger’s theorem implies the number field ver- 
sion of the Brumer-Stark conjecture for cyclotomic extensions of Q. 

Suppose m is a positive integer which is either odd or divisible by 4 and 
consider the cyclotomic field Ky, = Q(Gm). Let S be the set of primes 
dividing m together with the archimedean prime of Q. The first task is to 
compute the element w = ws x, /g. 

Let t € Z be relatively prime to m and 1 < t < m. Let o; be the 
corresponding element of Gal(K,,/Q). As is easily seen, ifn > 0 is relatively 


268 Michael Rosen 


prime to m, on = ((n), Km/Q) = o; if and only if n =t (mod m). Thus, 


~ 1 = 1 
Cs(w, or) = a nw = d (t+ hm)” . 


n=t (mod mm) 


For any real number b with 0 < b < 1, the Hurwitz zeta function is 
defined by the formula 


1 
(uy 8) =, (b+h)¥ 


It follows that Cs(w,o1) = m~“¢(w, t/m). It is a well-known property of 
the Hurwitz zeta function that for every integer n > 1 we have ¢(1—n, b) = 
—B,,(b)/n, where B,,(b) is the n-th Bernoulli polynomial. A good source 
for this is Washington [1] or Lang [6]. For n = 1 we have By(b) = b — 5. 
Putting all this together yields 


1 t 
C5 (0, +) = 5 _ m ) 
and so 
~~ sl t 
= EG 
> 2 m/ * 


Assume first that m is odd. Then, Wx, = 2m and so 


m—1 m—1 
w=Wr O= (m — 2t)op' =mN — 2 S- top. 
t=1 t=1 
(tym)=1 (t,m)=1 


Here, N=) 0 ,¢qg¢ is the norm map. 
Let P be a prime of K,, which is prime to m. Then, using the explicit 
expression we have just derived for w and Stickelberger’s theorem, we find 


“= (Sp) = Gp) 


This verifies the first part of the Brumer-Stark conjecture when D = P is 
a prime ideal which is prime to m with ap = NP™/g(P)?™. By Proposition 
15.6, part 3, ap has absolute value equal to 1. It is easily checked, by 
using the Galois properties of Gauss sums, that every Galois conjugate 
of ap also has absolute value 1. This verifies the second condition of the 
conjecture. Finally, since Wx = 2m in the case we are considering, we 
find that Ap = NP1/2/g(P) so that Km(Ap) © Qn, Sp, VNP) which is 
abelian over Q. If m is odd, the full Brumer-Stark conjecture for any divisor 
D prime to m follows from this. 


15. The Brumer-Stark Conjecture 269 


When ™ is even and divisible by 4 we have Wx,, = m. The proof in this 
case differs insignificantly from the case we have considered. We leave the 
details as an exercise. 


We now turn our attention to the function field case. To ease the expo- 
sition we will restrict our attention to abelian extensions K/k which are 
geometric; i.e., both K and k have the same constant field, F, which is a 
finite field with q elements. Under this condition, group of roots of unity 
in K is just F*, so that We =q-1. 

Our first task is to consider more closely the L-function evaluator 6(w) = 
Ox /n,9(w) (we fix an abelian extension K’/k of degree n and a finite set of 
primes S' of k which contains all the ramified primes). In the function field 
case, all the S-L-functions which occur are rational functions of u = q~”. 
We write Ls(w, x) = Lg(u, x), ¢s(w,o) = Cs(u,o), and 0(w) = 6(u). From 
Proposition 15.10, Equation 2, we find 


és(u,2) = = D> x) Es(usx) 3) 


XxEG 


Let E = Q(¢,). All the characters in G have values in E. It follows 
from Theorem 9.24, and the Artin reciprocity law (Artin L-functions can 
be identified with Hecke L-functions), that for x non-trivial Lg(u, x) is a 
polynomial in u with coefficients in FE. If vy = xo, the trivial character, then 


Ls(u, Xo) — I] (1 — uses P) Zr (u) = I] (1 _ utes P) Lx (u) 


Pes Pes 1—u)(1— qu) 


Since S' is non-empty by assumption, it follows that (1 — qu)Lg(u,Xo) € 
Zu}. 

It follows from all this and Equation 3, that (1 — qu)@(u) € E/ul[G]. We 
claim that it is actually in Z[u][G). 

To see this, note that from the definition of the partial zeta functions we 
have 


Cs(w, 0) = » —— = So uS®? = Cs(u,o) . 


D, (D,K/k)=o D, (D,K/k)=o 


It follows from this and Proposition 15.11 that (1 — qu)6(u) € Z[ful][G]. 
Since E/u] N Z[[u]] = Zlu], we have proved— 


Theorem 15.13. Let 6(w) = 6(u) = Ox /k,g(U be the L-function evalua- 


~ 


tor. Then, (1 — qu)0(u) is an element of Zlu|[G|. Evaluating at u = 1 we 


~ 


have (q — 1)@ = (q —1)0(1) € ZIG). 


The only point which perhaps needs some explanation is the last asser- 
tion. Recall that wu = q~™. It follows that 6 = 6(0) = @(1). 
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Since, as we have already pointed out, Wx = q—1 under our hypothesis 
that K/k is a geometric extension, we see that Theorem 15.13 is a strong 
function field version of Theorem 15.12. 

As before, set w = (q¢ — 1)@ € Z/G]. We can now state the theorem of J. 
Tate mentioned in the introduction to this chapter. 


Theorem 15.14. Let K/k be a geometric abelian extension of global func- 
tion fields with Galois group G. Let w = (¢q—1)@ € ZG] be the Brumer 
element defined above. Then for every divisor D of K of degree zero, we 
have wD = (ap), a principal divisor of K. In other words, w annihilates 
the group of divisor classes of degree zero, Cl. 


This theorem proves a big piece of the Brumer-Stark conjecture in the 
general case. We will give the proof at the end of the chapter. Our next 
task is to use this result to prove the full Brumer-Stark conjecture for the 
cyclotomic function fields K,, = k(Am) and K7, = k(A,,)t which were 
defined and investigated in Chapter 12. Note that K,, now denotes the 
cyclotomic function field generated by adding the m-torsion on the Carlitz 
module to the rational function field k = F(T’). Here m is a non-constant 
monic polynomial of degree M in the ring A = FIT). 

The sets S and S* corresponding to K,,/k and K;,/k will consist pre- 
cisely of the ramified primes. Thus, S = {P | P|lm}U {oo} and St = 
{P| P\m}. We recall that oo is ramified in K,, and splits completely in K;7, 
(see Theorem 12.4). We wish to calculate 6 = 0x, /~,5 and — Dict /k,S+: 


Proposition 15.15. With the above definitions and notations we have 


1 
__ y } —1 
(a) 6= Oa eo 


a monic 
dega<M, (a,m)=1 


(b) gt = S- (M — dega —1)o,;! - —— 
deg a<M, (a,m)=1 


In the first equation, N = DoE Gal( Kun /k) ao, and in the second, Nt = 


ac cCal(Kt /k) o , 2.e., the norm maps. 


Proof. Recall that Gal(K,,/k) = {oq | (a,m) =1 and dega < M}. Here 
O, is the unique automorphism with the property that og(A) = C(A) for 
all A € Am. In fact, this condition defines a, for any a € A with (a,m) = 1. 
We have o, = 0» if and only if a = b (mod m). Moreover, ((a), Km/k) = oa 
if and only if a is monic. For all this see Chapter 12. 

Since S consists of the primes dividing m and oo, in the definition of the 
partial zeta function we sum over effective divisors relatively prime to m 
with no component at oo. This is the same as summing over ideals in A 
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which are prime to m. Every ideal D has a unique monic generator d and 
ND = |d| = q?@¢. Thus, assuming a is monic, we have 


(swio)= > ype= Da 


(D,S)=1 d monic, (djm)=1 
(D,Km/k)=ea T4=Ca 
=|al-" + SO lat aml’ = fal’ +|mj-" Sal 
hea h monic 


h monic 
= jal~" + |m| Togise | 

If a is not monic, the calculation is exactly the same except that the 
term |a|~” does not appear. Thus, ¢s(0,¢.) = 1—(q—1)~' if a is monic 
and ¢s(0,0,) = —(¢—1)~? if a is not monic. The expression for @ given in 
part a of the proposition follows immediately from these results. 

Recall that K7* is the fixed field of {og | a € F*}. It follows that 
Gal(K7/k) = {a4 | (a,m) = 1 and dega < M and a monic}. Here we 
are identifying og with its restriction to K;{. As automorphisms of K;, we 
have og = 0, if and only if d= aa (mod m) for some a € F*. 

Since St consists only of primes dividing m, in the definition of the 
partial zeta we sum over all effective divisors of the form D = Dy + ico, 
where Dy is an effective divisor prime to m and oo and 7 is a non-negative 
integer. As before, Dy corresponds to an ideal of A with a monic generator 
d which is prime to m. 

Since oo splits completely in K+, we have (oo, Kj,/k) = e. Thus, for a 
monic we have 


1 = 1 
cse(wioa)= pw 2D ND pay 
(D,S*)=1 i=0 (D,,Kt,/k)=00 

(D, Kin /k)=00 
Now, N(Dy + too) = ND;N(oo)* = |d|q’. Thus, we can rewrite this ex- 


pression as 


Co+ ( (w Oa) -S- S- \d|~ w go _ —— S- \d|-” 


—q 
1=0 d monic d monic 
Cg=Oo Cd=Oe 
Here, d runs over monic polynomials prime to m with og = og. As we 
have seen, the latter condition holds if and only if d = aa (mod m) for 
some a € F*, which is equivalent to the condition a~'d = a (mod m). In 
other words, we can sum over all d € A (not just the monics) with d= a 


(mod m). Thus, 


dS la = Say = Jaf + YO Ja + hm 


d monic de€A hEA 
T4=Ja d=a (mod m) h#0 
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—w —w — —w q—1 —w 
=|al- + (q—1)|m|-” SO Al-” = a + pew 


h monic 
Putting all this together, we find 
Cs+(W, 0a) = (1-97 ¥) “(al + (g— YL g'*) m|™) 
(1 ~ qu)ui*e* + (g — 1)usee” 
(1 — u)(1 — qu) 
As usual, we have substituted u = q~™” and simplified somewhat. We need 
the value of this function at w = 0 or what is the same, at u = 1. If we 
substitute wu = 1 into the above expression, we find that both numerator 


and denominator vanish. Invoking L’H6pital’s rule, we differentiate both 
numerator and denominator and then substitute u = 1. The result is 


x 1 
C5+(0,0a) = C5+(1, 04) = degm — dega—1—- qo1 


From this the proof of part b of the proposition is immediate. 
Define 


_ —1 
n= ) oo,’ and nt = ) (M —dega—l)og’ . 
a monic a monic 
dega<M, (a,m)=1 dega<M, (a,m)=1 


We can now write 9 = 7 — (q—1)7!N and 6+ = nt — (q—1)7'NT. 
Also, for the Brumer elements we have w = (¢—1)0 = (¢—1)n—WN and 
wt = (q—1)n* — N°. This method of writing things will be of importance 
to us because of the following result of B. Gross [1]. 


Proposition 15.16. The element 7 annihilates ClR and the element nr 
annihilates Cl + ; 


We will prove this later as a corollary to the proof of Theorem 15.14. 

The last ingredient we will need is the prime decomposition of a primitive 
m-torsion point on the Carlitz module. The miraculous thing that happens 
is that this decomposition is essentially given by the Brumer element wT. 


Proposition 15.17. Let Bq be a prime of Km lying over «~ ink. There 
exists a primitive m-torsion point A € Am such that 


(A) = ((a—1)nT — 7) Boot Bm - 


The element \9~1 is in K+. As an element of Kt, its prime decomposition 
is given by 

(AT*) = wT BS + Br - 
Here, B,, is the unique prime of Km lying above P ifm = P® is a prime 
power and is the zero divisor otherwise. B, is the prime of K7, lying below 
Bn. Finally, PI is the prime of K>, lying below Boo. 
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Proof. Let kx be the completion of k at oo and let kg be its algebraic 
closure. Let ord,, denote the normalized additive valuation of k,, extended 
to kg in the usual way. Finally, let 0: K,, > ko be an embedding and let 
‘Bo be the corresponding prime of Ky». 

Using the results of Chapter 12, in particular Proposition 12.13 and 
Theorem 12.14, we see there is a primitive m-torsion point for the Carlitz 
module, A, such that ord.(tog\) = M—dega—1—(q—1)7} foranyae A 
relatively prime to m and with degree less than M. Since $8., is ramified 
over k with ramification index g — 1, we can write this as 


ord -1g, (A) = ord, (2A) = (¢—-1)(M —dega—1)-1. 


The decomposition group of Bo is {aq | a € F*}. It follows that the set 
of distinct primes above oo in K,, is {o7'$8. | a monic and dega < M}. 
We recall Proposition 12.7, which shows that if m = P*® is a prime power 
there is exactly one prime ideal above P in O, C Km and it is totally 
ramified and generated by ». Otherwise, is a unit in O,, by the second 
part of Proposition 12.6. It follows that the prime decomposition of the 
divisor (A) is given by 


N= DP ((g = 1)(M = dega — 1) ~ 1) oo Boo + Bm 
deg a<M, (am)=1 

From this and the definitions of 7 and 7* we get the first assertion. 

All the primes {o7'8., | a@ monic and dega < M} are totally and tamely 
ramified over K;> of ramification index q—1. The same is true of %,, when 
it is non-trivial. The second relation follows easily from these remarks, the 
first relation, and the fact that wt = (¢—1)nt — N7. It is also helpful to 
notice that 7 restricted to K+ is NT. 


We have now assembled everything we need to prove the Brumer-Stark 
conjecture for K,,/k and K> /k. 


Theorem 15.18. Let k = F(T), Km = k(Am), and KX = k(A,)t, 
the maximal real subfield of K,,. The Brumer-Stark conjecture is valid for 


Kin/k and Ky /k. 


Proof. Let D be any divisor of K,,. Since % has degree 1 we can write 
D = Do +t Boo where t = deg D and Do has degree zero. Since the decom- 
position group of Bq is {gq | a € F*} we see that NBo = (¢q —1)n Bo. 
Thus, 
WB oo = ((¢—1)n — N) Poo = NBo-NPo = 0. 

From this and Theorem 15.14 we see that wD = wDo = (ap) for some 
ap € K;. This proves the first part of the Brumer-Stark conjecture for 
Kin /k. 

To prove the second part we make use of Gross’s result, Proposition 
15.16. From this we know that 7Dp is already principal. Set 7Do = (Gp). 
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Notice, also, that N Do = (d) where d € k*. This follows from the fact that 
Cl is trivial. Therefore, 


wD = wDo = (¢—1)nDo — NDo = (85 *) — (4) = (66 *d7") . 


We see that we can choose ap = vid! and so the field generated by 
Ap = Yap over Ky, is the same as the field generated over Ky, by IVa. 
Now, k( *~V/d) /k is a Kummer extension and consequently a cyclic exten- 
sion of fields. Thus, K,,(Ap) is the composite of two abelian extensions of 
k, namely, K,, and k( 7d), and so is itself an abelian extension of k. This 
completes the proof for the case K,,/k. 

Now consider the case K+ /k. Once again, any divisor D of K;, can be 
written in the form Dp +t where t = deg D. By Theorem 15.14, we 
find wt Do = (ap,) is principal. From Proposition 15.17, we have wT BX = 
(A\7-!) — B+. Thus, 


wtD = (ap, A°97 V4) —~ ter , 


which verifies the first part of the Brumer-Stark conjecture for K;7,/k. 
To prove the second part of the conjecture we use Proposition 15.15 once 
more to deduce that 7* Do = (Gp,) is principal. It follows that 


wt Do = ((q-1)n* — N*)Do = (8h,'d7') , 


where d € k* is such that NtDp = (d). Thus, we can choose ap, = 
BE d=} and so 


wt D = (681d 1NOYY) — BS, 


We can set ap = BEd AGE, From this we see that \, which is the 
q—1 root of ap generates the same field over K,, as “Wd. Thus, K+(\}) 


is contained in K,,( 7d), which is abelian over k as we showed in the first 
part of the proof. This completes the proof for K> /k. 


Remarks. 


1. We hope there is no confusion caused by the notation Ap for the ele- 
ment appearing in the statement of the Brumer-Stark conjecture and the 
element A, a primitive m-torsion point of the Carlitz module. 


2. For the reader who is familiar with the classical situation there may be 
some surprise that the Brumer element for K;, is non-trivial. The Brumer 
element for Q(¢m)*/Q is zero. This is because in this case S* contains 
the archimedean prime of Q and this splits completely in Q(¢,,)*. It can 
be shown in general that a prime in S which splits completely in K forces 
the Brumer element wx/;,5 to be zero. In the function field case, there are 
no archimedean primes. St contains only those primes dividing the monic 
polynomial m, all of which ramify in K;. 
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3. Using functorial properties of the Stickelberger element and Theorem 
15.18 on can show that if k is the rational function field and K C Ky» for 
some monic m € A, then the Brumer-Stark conjecture holds for K/k. 


We now begin to describe the background necessary for the proof of 
Tate’s theorem, Theorem 15.14. 

As we have seen, in the function field case, when we describe everything 
in terms of u = q~”, all the functions in question are rational in u with 
coefficients in F = Q(¢,), where n = [K : kj. It will be necessary for us 
to work with characters whose values occur in the algebraic closure of Q). 
Here, / is an arbitrarily chosen prime in Z different from p, the characteristic 
of F. We write E; for a finite extension of Q; containing the n-th roots of 
unity. The same analysis given earlier shows that 


6(u) = So Ls(ux")e(x) = Yo Ss (ua)o™ . 


Now, of course, G = Hom(G, Ey‘), L(u,x) € E;(u), and e(x) € Ey[G]. It 
is still the case that (1 — qu)@ € Z[u|(G]. The necessity for these changes 
will become apparent in a little while. 

Let F be the algebraic closure of F, k = kF, and K = KF. Since K/k isa 
geometric extension, we have KNk = k. It follows that the Galois group of 
K/k is the direct product of Gal(K/k) and Gal(K/K). The first group is 
naturally isomorphic to G, so we will now think of G as automorphisms of 
K which leave k fixed. Let ¢ be the automorphism of K /K which induces 
the automorphism “raising to the q-th power” on F. This is called the 
Frobenius automorphism of the extension. Note that @¢ commutes with the 
elements of G as automorphisms of K. 

In Chapter 11 we introduced the notation J for the divisor classes of 
degree zero of K,i.e., J = Clg. The corollary to Theorem 11.12 gives the 
algebraic structure of J[N], the points of order dividing N on J. If pt N, 
then 


where g denotes the genus of K. 

Choose and fix a rational prime | 4 p and consider the groups J[I”]. It 
is clear that for each positive integer n, multiplication by | maps J[I"*+1] to 
J[{I”|. We define the Tate module, 7;(J) as the inverse limit of the groups 
J(I”] under these maps. It is possible to give a very concrete interpreta- 
tion of this group. Namely, the elements of T;(J) can be identified with 
infinite-tuples, (a1,@2,a3,...), where for all n > 0 we have a, € J[I”| 
and lans, = an. The Tate module is acted upon by the l-adic integers 
Z; in the obvious way; if a € Z and a = (aj,42,43,...) € T)(J), then 
aa = (aa 1,Qa2,aa3,...). Similarly, since G and ¢ act on each J[!”), these 
actions can be extended diagonally to an action on T;(J). Thus, T)(J) is a 
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Z(G] module with an action by ¢ which commutes with the action of Z(G]. 
Using the above structure theorem for J[N], one can show that T)(J) is a 
free Z, module of rank 2g. We set Vi = Vi(J) = Q; @z, Ti(J). V; is a vector 
space of dimension 2g over Q; with a natural action by Q;[G] and ¢ and 
these two actions commute. 

We need enough roots of unity in our coefficient field. To this end define 
V = E, gq, Vi. By Proposition 15.5, we have the following decomposition: 


V=S_ V(x). 


XEG 


Since the actions of ¢ and G commute, it is easy to see that each Ey- 
vector space, V(x), is mapped into itself by ¢. We need the following two 
results. 


Proposition 15.19. If a polynomial in ¢, f(¢) € Z[G||¢], vanishes on V), 
then it vanishes on J. 


Theorem 15.20. The determinant of 1— du acting on V, is the numerator 
of the zeta function of the field K, i.e., 


det(1 — gu)|vy, = Lx(u) . 


Suppose x € G, x # Xo. Let d(x) be the E,-endomorphism of V(x) induced 
by d. Then 


det(I — (x)u) = L(u, x71) 


For the trivial character, x,, we have 


det(I — 6(x0)u) = Le(u) 
where Ly(u) is the numerator of the zeta function of k. 


Proposition 15.19 is a consequence of a far more general result about 
geometric endomorphisms of abelian varieties. The point is that any such 
polynomial f(¢) can be thought of as an element of Endg(J), regarding J 
as an abelian variety over F. It is not a difficult result given the necessary 
background. Theorem 15.20, on the other hand, is a major theorem. It is 
due to Weil. The proof can be found in the original book of Weil [2]. A 
more modern exposition can be found in the article by J. Milne [1]. We will 
simply accept the result as true and deduce consequences. 

We now have everything we need for the proof of Theorem 15.14. We 
know by Proposition 15.13 that 


(1 — qu)6(u) = S_ (1 —qu)Ls(u,x~")e(x) € Z[ul[G]. 
xEG 


For each yx # x0, Ls(u,x) is a polynomial in EF, [u] which is divisible by 


a 


the Artin L-function L(u, x). For y = x. we know that (1 — qu) Ls(u, Xo) 
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is in Z[u] and is divisible by L,(u). From this, Theorem 15.20, and the 
Cayley-Hamilton theorem, we see that (1 — g@~!)6(¢~*) induces the zero 
endomorphism on V. Multiplying by a sufficiently large power of ¢, #™ say, 
we see that 


f(¢) = (1 — q71)6(7') € Z[G][GI 


is a polynomial in ¢ with coefficients in Z[G]. Since it vanishes on V, it 
vanishes on V;(J) and, by Proposition 15.19, it vanishes on J. Since ClZ = 
J(F) c J, we see f(¢) restricted to J(F) is zero. However, ¢ restricted to 
J(F) is the identity, so f(¢) restricted to J(F) is f(1) = (1—q)@(1) = —w. 
This shows that w annihilates J(F) = Cl?, as asserted. 


It remains to prove Gross’s result, Proposition 15.16. This will follow 
from the proof, just given, of Tate’s theorem. We need explicit expres- 
sions for the elements 6(u) and 6*(u) associated to the cyclotomic func- 
tion field extensions K,,/k and K;/k, respectively. These were implicitly 
constructed in the course of the proof of Proposition 15.15. 

For the extension K,,/k, we found that C(u,o,) = u%@*+(1—qu)~!utee™ 
if a is monic and (1 — qu)~!u%*é™ if a is not monic. Thus, 


y@ 


O(u) = ) ute rg ly 
a monic 
dega<M, (a,m)=1 


1 — qu 


We note that the norm map N induces the zero mapping on Vj, and 
thus on V, since Cl? = (0) (because k is the rational function field). As in 


~ 


the proof of Tate’s theorem, substitute d~! into (1 — qu)@(u) and multiply 
by #™ to obtain a polynomial in Z[G][¢] that annihilates V. Because the 
norm element annihilates V, we find that 


(¢ —q) S- gM—l-degag—l 


a monic 
dega<M, (a,m)=1 


annihilates V. The endomorphism of V induced by ¢ — q is invertible since 
its determinant is given by 


det(¢ — q) = det —q(1 - q‘¢) = q’9 LK (q_") , 


We have used the first part of Theorem 15,20. By the functional equation for 
the zeta function we find that the last quantity is a power of q times Lx (1), 
which is the class number of kK (see Theorem 5.9). Therefore det(@—q) 4 0. 
It follows that the element 


S- gM—l-dega,—1 E Z(G](¢] 


a monic 
dega<M, (a,m)=1 
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annihilates V. By Proposition 15.19, it annihilates J and so its restriction 
to J(F) = Cl% is the zero mapping. Since ¢ restricts to the identity, the 
restriction of this element is 7. We have proven Proposition 15.16 in the 
case K,,/k. 

The proof in the case K;/k is similar, but a bit more complicated. We 
sketch the proof and leave it to the reader to check the details. Recall that 
in the proof of Proposition 15.15 we showed 


“ao, _ (Logue + (q = 1)utee 
(ga) = ald — gu) 


Using this, and a little algebraic manipulation, we deduce the following 
identity: 


(1—qu)6(u) = (1—qu) > 


a monic 
dega<M, (a,m)=1 


uses e _ 1 1—qu+(q—1)u™” 


l—u 


—1 


On + Nt. 


l—wu 


All the rational functions of u occurring as coefficients are actually poly- 
nomials since the numerators vanish at u = 1. Now, substituting u = é7!, 
and following the same steps as in the first part of the proof leads to the 
conclusion that the following element annihilates Cli + 


S- —degaaz’. 


a monic 
dega<M, (a,m)=1 
However, this element differs from 7* by an integer multiple of the norm 
map, N*; so we find that 7* annihilates Cll + and the proof is complete. 


Having come this far, the reader who is interested in the proof of the 
Brumer-Stark conjecture in the general case for function fields has two di- 
rections to go. Learn the necessary background about 1-motives and read 
Deligne’s proof as presented in Chapter V of Tate’s monograph [1]. This 
proof does not involve Drinfeld modules at all. On the other hand, by learn- 
ing more about the theory of Drinfeld modules one can build up enough 
background to read the paper Hayes [5], which gives an elegant proof involv- 
ing no algebraic geometry beyond the Riemann-Roch theorem for curves. 
Hayes relies instead on the more advanced theory of Drinfeld modules. The 
“mixed” proof we have given here for the case of cyclotomic function fields 
should provide a good head start in either direction. 


Exercises 


In the following problems, K/k will denote a finite, geometric, abelian 
extension of global fields, G the Galois group of K/k, and S a finite set of 
primes of k which includes all those which ramify in K and, in the number 
field case, all the archimedean primes. We will often shorten the notation 
for the Stickelberger element 0% /, 5 to Og. 


15. The Brumer-Stark Conjecture 279 


1. If P¢ S, show that O5u;p} = (1 —op')Og where op = (P, K/k). 


2. For any prime P of k, let Np = DI oEZ(P) ao, where the sum is over 
all the elements of the decomposition group of any prime in K lying 
over P. Suppose P € S and that #(S) > 2. Show that Np@s = 0. 
Conclude that 65 = 0 if any prime in S' splits completely in K. Hint: 
Consider x(Np@s) for all v € G and use Proposition 14.12. 


3. Let K’ be an intermediate extension between k and K. Let G’ = 
Gal(K'/k) and m : G > G’ the natural map given by restriction. 
Show that ™ (OK /k,S) = OK! /k,S: 


4. Let e: Q|G] > Q be the augmentation map defined by e(}), r(a)o) = 
>, r(a). If #(S) = 2, show that (0s) =0. 


5. We showed in the text that (q — 1)@, € Z[G]. Show that (o —1)@s € 
Z|G] for every o € G,o # 1. Hint: If Og(u) is the L-function evaluator, 
show first that (o — 1)@s(u) is a polynomial in wu. 


6. In Deligne-Ribet [1], the authors show that for primes P which are 
unramified in K and do not divide Wx we have (op —NP)0g € Z|GI, 
where op = (P, K/k). Let D be any divisor (ideal) of K and assume 
the Brumer-Stark conjecture is true. Assume also that #(S) > 2. For 
P¢S,P+tWkr, and P ¢ Supp(D), show that (cp -NP)60sD = (ap) 
where ap € K*. Hint: By the Brumer-Stark conjecture, WxK0sD = 
(a), where a € K* and K())/k is abelian where AY* = a. Let 
oy = (P, K(A)/k). Show that (op -NP)6sD = ((op—NP)A), where 
both sides are interpreted as divisors in K(A). Let ap = (op—NP)A. 
Show that it suffice to prove that ap € K* and then prove this using 
Galois theory and the fact that op — NP annihilates ux. 


The next exercises are based on another conjecture of Stark, which is 
in turn a very special case of a broad class of conjectures on the value of 
Artin [-functions at zero. Let T be a finite set of primes in & such that 
all primes which ramify in K and all archimidean primes (if there are any) 
lie in T, #(T) > 2, and at least one prime P, € T splits completely in K. 
If #(T) > 3, define U\) to be the set of elements in K* which are units 
except possibly at the primes lying above P,. If T = {P,,Q} define U (0) 
to be all T-units u which satisfy |ulo = |u|eg for all o € G. Finally, let PB, 
be a prime in K lying above P,. 


Conjecture A. If 7 satisfies the conditions just stated, there is an element 
e, € U) such that 


1 
L'(0, x) = “We S_ x(o) log |oeolm, ; 
K gEéG 


for all x € G. Moreover, K( “x /€o)/k is an abelian extension. 
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The next exercises indicate how one can deduce the Brumer-Stark con- 
jecture from Conjecture A. 


7. 


10. 


11. 


12. 


Let S' be a non-empty, finite set of primes of k containing the primes 
which ramify in K and, in the number field case, the archimedean 
primes. Let P, be a prime of k which splits in K and set T = SU{P}. 
Prove that L/,.(0, x) = log(NP,)Ls(0, x). 


. Let P, be a prime which splits completely in K and $8, a prime of 


K lying above P,. Prove that ordy,a = — log |a|x, /log(N Po) for all 
aé K*. 


. Use Conjecture A, stated above, together with the last two exercises 


to deduce 


1 
Ls(0,x) = We S~ x(c)ordgy (ceo) . 
aéG 


Use Exercise 8 and Proposition 15.10 to deduce Wx¢s(0,0) = 
ord,-193, (€0). 


Assuming #(S) > 2 show that Conjecture A implies WrKés2o = 
(€,), which verifies Brumer-Stark for the prime divisor Bo. Show 
Conjecture A also implies Brumer-Stark at 8, in the remaining case 


where #(S) = 1. 


The result of Exercise 10 can be used to prove the full Brumer-Stark 
conjecture if one assumes Conjecture A. We sketch a proof and invite 
the reader to fill in the details. We have seen that Brumer-Stark 
is true for a prime % ¢ S provided that $ has relative degree 1 
(f($8/P) = 1). Choose one such prime , and let S, be the set 
consisting of %8, alone. The S,-class group (of K) is finite and every 
class in it is represented by infinitely many primes of relative degree 
one. This follows from considerations of L-functions associated to 
Clg. Using this, show that if D is any divisor of K we can write 
D = $% + (a) + m,, where % ¢ S is a prime of relative degree 
one, a € K*, and m is an integer. The result now follows from the 
Exercise 11 and the fact, proved in Tate [3], that Brumer-Stark is 
true for principal divisors. 


In the next set of exercises we sketch the proof of the Brumer-Stark 
conjecture in the case of relatively quadratic extensions of global function 
fields. Let K/k be a geometric extension of degree 2. Assume that the 
characteristic of k is not 2. Let the Galois group G of K/k be generated by 
T. Let x be the unique non-trivial character of G. Let S be a finite set of 
primes of k which include all those primes which ramify in K. Finally, let 
S’ be the set of primes of K lying above those in S. We will assume |S| > 2 
and that no prime in S splits in K (otherwise the Stickelberger element 05 
would be zero). 


13. 
14. 


15. 


16. 


17. 


18. 


19. 


20. 
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Prove that 65 = 2—!Lg(0, x)(1 — 7). 


Use the relation ¢x,5/(w) = Cx,s(w)Ls(w,x) to show (with obvious 
notation) 
hrs Rx,s' 
Ls(0,x) = —>——_ . 
s(0;x) hrs Rr,s 


Let s = |S|, Ux,s: be the S’-units of K, and Ux,5 be the S-units of 
k. Show 
2°" Rig = (Ux, : Ux.s|Rx,s° - 


Hint: Since no prime in S is ramified or split in K show that for every 
u €k*, log |ulg = 2log|ul|p for P\P € S. 


Show that the kernel of the natural map from Clk,5 — Clx,s’ is 
isomorphic to H'(G,Ux,s:) = {u € Ux | wu™ = 1}/{u/u™ | we 
Ux,.s'}. The reader who does not know cohomology of groups may 
just want to accept this fact “on authority.” 


For all u € Ux, s: show that u7 = tu. Hint: First show u7/u € F* by 
showing |u7 /u|s3; = 1 for all primes % of K. 


The map u —> wu’ /u gives rise to an exact sequence (1) — Uk. - 
Ux,s: — (+1). Use this and the definition of H1(G,Ux,5:) to prove 
that [U5 : Ux,s||H'(G,Ux,s)| = 2. Hint: Consider individually 
the following two cases: the case where u = u” for all u € Ux,s) and 
the case where there is a up € Ux.s such that uz = —uo. 


Let M be the number of elements in the cokernel of the natural map 
from Clr 5s — Clx,s:. Use the last few exercises to give the following 
explicit description of the Brumer element: 


—] 
ws =(q-1)6s = i 2 M(1 —r). 


Use the result of Exercise 19 to verify the Brumer-Stark conjecture for 
the extension K’/k and the set S. For all this in the case of algebraic 
number fields consult Tate [3]. 
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The Class Number Formulas 
in Quadratic and Cyclotomic 
Function Fields 


In this chapter we will discuss the analogues of some fascinating class num- 
ber formulas which are well known in the case of quadratic and cyclotomic 
number fields. Some of these go back to the nineteenth century, e.g., the 
work of Dirichlet and Kummer. More recent contributions are associated 
with the names of Carlitz, Iwasawa, and Sinnott. We will review some of 
these results and then formulate and prove a number of analogues in the 
function field context. 


Let’s begin by reviewing the class number formulas for quadratic num- 
ber fields (for details, see the classical text of E. Hecke [2]). We need the 
definition of the Kronecker symbol which is a mild generalization of the 
Jacobi symbol of elementary number theory. Suppose d is an integer con- 
gruent to either 0 or 1 modulo 4. If p is an odd prime, define (d/p) to be 
the usual Jacobi symbol. If p = 2, define (d/2) = (d/ — 2) to be 0 if d is 
even, 1 if d= 1 (mod 8), and —1 if d=5 (mod 8). Now define (d/m) for 
any non-zero integer m by multiplicativity. This new symbol is called the 
Kronecker symbol. It is useful in the theory of quadratic number fields, as 
we will see in a moment. 


Let d € Z be square-free and consider the field K = Q(Vd). The dis- 
criminant of K/Q, 6x, is dif d=1 (mod 4) and 4d if d = 2 or 3 (mod 4). 
If xa is the non-trivial character of Gal(K’/Q), then it can be shown that 
the Artin L-function L(w, ya) is given by 
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By using the relation ¢x(w) = ¢g(w)L(w, xa) and comparing the residue 
of both sides at w = 1 we link up the class number of K with the value 
of L(w, xa) at w = 1. Pursuing these ideas leads to the following theorem 
which is due to Dirichlet. 


Theorem 16.1. The class number h of the quadratic number field K = 
Q(Vd) is given by 


|5x|-1 
(a) h= 5 S° m(6x/m) , 
if dn < —4, and by 
(b)  — 1 I], sin(7a/6x ) 


Qlog(e) ~ [],sin(mb/5x) ” 


if dx > 1. Here, € >1 is the fundamental unit of K, and a varies over all 
integers between 1 and bx with (6x /a) = —1, and b varies over all integers 
between 1 and dx with (6x /b) = 1. 


In the case that d is negative, K is called an imaginary quadratic number 
field. Part a of the theorem shows that the class number of such a field can 
always be computed in finitely many steps. It turns out that this is not 
the most efficient way to compute the class number, but the formula is 
remarkable nevertheless. 

In the case where d is positive, K is said to be a real quadratic number 
field. The Dirichlet unit theorem tells us, in this case, that the unit group 
modulo (+1) is infinite cyclic. There is precisely one unit € in K which 
is greater than 1 and projects on to a generator. This unit is called the 
fundamental unit in L. In part b of the theorem, let 7 be the quotient of 
the product of values of the sine-function which appears on the right hand 
side of the equation. We have 


1 

= Plog(e) log(7) , 

from which it follows that «?” = 7. This shows that 7 is a unit of K which 
can be explicitly constructed using special values of the sine-function. It is 
called a cyclotomic unit. It turns out to be a general phenomenon that for 
totally real abelian number fields, the class number is related to the index 
in the whole unit group of an explicitly constructed subgroup of cyclotomic 
units. 

We next consider the cyclotomic fields K,, = Q(¢m) and their maximal 
real subfields K* = Q(Gm +71). Let hm denote the class number of Kyn, 
and h denote the class number of K;. It can be shown that h*|hm so 
that h,, = h« h-, where h- is an integer called the relative class number. 


me “mm 


We will state results about both h*, and hz. 
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For a prime to m let og € Gal(Km/Q) be the automorphism which takes 
Cm to ¢%,. This induces an isomorphism (Z/mZ)* = Gal( Km/Q). Note that 
o_; is complex conjugation. Any character of Gal(K,,/Q) can be thought 
of, via this isomorphism, as a character on (Z/mZ)*. We call x an even 
character if y(—1) = 1 and an odd character if y(—1) = —1. Since —1 
corresponds to complex conjugation, we see that the even characters are in 
one-to-one correspondence with the characters of Gal(K7,/Q). 

For the sake of simplicity, we restrict ourselves, in the statement of the 
next two theorems, to the case where m = p, an odd prime. 


Theorem 16.2. Let h> be the relative class number of Q(¢p). Then, 


hy =2p || (-75 xo) , 


x odd 


where the product is over all odd characters of (Z/pZ)*. 


This beautiful result is due to Kummer. It shows that the relative class 
number can be computed in finitely many steps. It also turns out to be use- 
ful in deriving divisibility results about the class number. As we shall soon 
see, it is possible to rework this formula in such a way that the calculation 
of h> involves nothing but elementary arithmetic in Z. 


Recall that in Chapter 12 we showed that the elements a are units 
in the field K,. Assuming that kK, C C we can choose ¢, = ep. Then 


Gp — 1 _ pt (a-1) sin(ma/p) 
Cp — 1 sin(7/p) 


The element e? ‘*~)) is a 2p-th root of unity and so is a unit in k,. Thus, 
the elements 
sin(7a/p) 
sin(7/p) 


are units in K, and, in fact, they are units in Ky . Note that the units 
corresponding to a and p—a are the same. Kummer showed that the units 
corresponding to a in the interval 1 <a < & are independent. He actually 
showed much more. 


fora = 2,3,...,p—1 


Theorem 16.3. Let Cr be the subgroup of units in Ky generated by the 
units . 
sin(wa/p) for l<a< P ; 
sin(7/p) 2 
and by +1. Let Et be the full unit group of Kt. Then, hg = [Ez : Cy]. 
This result can easily be generalized to apply to Kpt, i.e., to Km where 
m is a prime power. For a proof see Lang [6] or Washington [1]. It can 
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also be generalized to arbitrary positive integers m, but this is much more 
difficult than one might expect. It was accomplished by W. Sinnott in 1978 
(see Sinnott [1]), over a hundred years after Kummer proved his result. 

We will prove the analogue of Theorem 16.3 in the case of cyclotomic 
function fields K,, where m = P is a monic irreducible polynomial in F(T]. 
The result for prime powers was first proven in Galovich-Rosen [1] and 
generalized to arbitrary monic polynomials m in Galovich-Rosen [2]. In 
fact, much more general versions hold in the function field case. We will 
discuss these generalizations briefly after the proof of Theorem 16.12. 

A tool which is useful both in number fields and function fields is the 
Dedekind determinant formula. This result was communicated by Dedekind 
in a letter to Frobenius in 1896. 


Theorem 16.4. Let G be a finite abelian group, f a function from G to 
C, and G = Hom(G,C*). Then 


(a) det [f =|] S- x(¢ 


xEG TEG 
and 
O det [fo '7) — fo) = T] Sx). 
o,THe 
X#Xo TEG 


Proof. Let V be the vector space of all complex-valued functions on G. 
A basis for this vector space is given by the functions 6,(x) defined by 
6g(o) = 1 and 6,(r) = 0 for tr 4 o. The proof is straightforward. This 
shows the dimension of V over C is n := |G|. 

Another basis of V is given by {x | x € G}. The characters are linearly 
independent over C, as can easily be seen from the orthogonality relation. 
Since |G] = |G| by the corollary to Proposition 15.1, it follows that the 
elements of G are a basis for V, as asserted. 

Let G act on V by defining of to be the function which takes x to f(a). 
This extends to an action of the group ring C[G] on V as follows: 


(S- Agd )f= S- Ag ( (of). 
aEG aeEG 


Now, fix an element f € V and associate to it the group ring element 
T =>, f(a)o. The idea is to look at the matrix of T with respect to the 
two bases of V we have given and then take determinants. 

First, note that 06, = 6,,-1. Thus, 


T6, = S- f(o)brg-1 = Y> f(a7*r)be . 


It follows that the determinant of T is the determinant of the n x n matrix 


[f(o~*r)]. 
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Secondly, note that (ox)(x) = x(xo) = x(x)x(o) = x(7)x(x), so that 
x is an eigenvector for o with eigenvalue y(c). It follows that x is an 
eigenvector for T with eigenvalue 5°, x(a) f(a). The determinant of 7 
on V is just the product of its eigenvalues and this concludes the proof of 
part a. 

To prove part b we restrict T to the subspace Vo := {f €V | Sogea f(a) = 
0}. It is easy to see that this subspace is mapped into itself by every element 
in G and thus by every element in C[G]. We write down two bases for Vo. 
The first is {5, — 6. | 7 #4 e}. The second is {x € G| x # x0}. The proof 
that the first is a basis is a simple exercise. As for the second, we know that 
for x # Xo we have )> x(a) = 0. This shows the non-trivial characters are 
in V,. There are n —1 of them and they are linearly independent. Since the 
dimension of V, is n — 1, we conclude that the non-trivial characters form 
a basis. 

What is the matrix of T restricted to V, with respect to the basis {6, — 
de | T #e}? From our earlier computation we see 


T (5, —5e) = Tb, —T5e = Y°(f(o7'7) = f(o7") bo 


ceEG 


For any f € V we have 0 = oo eg(f(o7'7) - f(o~+)). Multiply both sides 
of this equation on the right by 6, and subtract the result from the last 
sum in displayed equation. We find 


T(6r~6e)= S> (f(om*t) — flo™"))(5a — 6e) 


aEG,aoFe 


This shows the determinant of T restricted to V, is the determinant of the 
(n—1) x (n—1) matrix [f(o~!r) — f(o~!)]. The proof of part b now follows 
from considering the action of T on the basis {y € G | v Xo}, exactly as 
in the proof of part a. 


We remark that if o is replaced by o~? in either determinant considered 


in the theorem, the effect is simply to permute the rows so that the deter- 
minant is multiplied by +1. Thus, det[f(o~!r)] = +det[f(or)]. We will 
use this remark shortly. It is a nice exercise to determine this sign change 
in terms of the structure of the group G. 

As an illustration of the use of the Dedekind determinant formula we 
will state and prove the promised reformulation of Theorem 16.1. 

First we recall a definition from elementary number theory. Let r € R 
be any real number. Then there is a unique integer n € Z such that 0 < 
r—n <1. We set n = [r] and r—n= (r). The latter quantity is called the 
fractional part of r. Note that ifa,m € Z and m # 0, then (<) depends 
only on the residue class of a modulo m. 

It will be convenient to write G,, = (Z/mZ)* and GY, = Gp,/(+1). 
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Let x be an odd character of (Z/pZ)* and define the generalized Bernoulli 


number 
p—l a 
Bix = > x(a)(— 
a=1 


The reader will recognize these are the quantities that appear in the state- 
ment of Theorem 16.2. Since both x(a) and (%) only depend on a modulo 
p we can rewrite this definition as 


Bix = 3 x(a 


a€Gp 


In this expression substitute —a for a and use the fact that x is odd to 
derive 


Bix = S> —x(a)(—) . 


a€Gpy P 
Now add both expressions for B;,, and we find 


Bix = 5 Dy x(a) (2)-()) | 


acG»y 


The summands are invariant under the substitution a ~ —a, so we get our 
final expression for B,,, 


Bix = YX x(a) ((2)-(=9)) . 


aeGt 


Theorem 16.5. Let hy be the relative class number of Q(¢p), where p is 
an odd prime number. Then 


_ 2p | ab —ab | 
h, = +—— det |(—) — (——)] , 
Po 9P et \ p -_ p 

where a and b are integers in the range 1<a,b< be. 


Proof. From Theorem 16.2 and the expression we have derived for the 
generalized Bernoulli number B,,,, we find 


hz = + er ve I Lx )((S) -(=*)) | 


From this, it is clear that all we have to prove is that the product in this 
expression is, up to sign, the determinant of the theorem. 
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Let w be any odd character on G,. The set of odd characters is easily seen 
to be the same as {wy’ |x’ even}. Thus, the product in the last expression 
is the same as 


IT © x'@ura) (($) - (=)) = £ee (ab (2) -(—))] | 


x’ even aeGt 


where we have invoked the Dedekind determinant formula as it applies to 


the group GF and the function f(a) = w(a) ((2) — (=2)), 


To complete the proof, simply notice that {a€ Z| 1<a< Po} is a set 
of representatives for Gt. Also, since w is a character, w(ab) = w(a)w(b). 
Thus, the terms involving w in the determinant can be factored out to give 


p=1 
the determinant of the theorem multiplied by [],2, (a)?. This is easily 
seen to be (-1) ">, and that completes the proof. 


This elegant result is was proved in Carlitz and Olson [1]. A discussion 
can also be found in Lang [6], Chapter 3. Later, we will give a function 
field analogue of Theorem 16.5. For now, we will concentrate on finding an 
analogue to Kummer’s theorem, ‘Theorem 16.3. 


Let’s begin by recalling some notation and results about cyclotomic func- 
tion fields. Let A = F[T], k = F(T), Am = the m-torsion points on the 
Carlitz module (m € A, a monic polynomial), Ky, = k(Am), and On, the 
integral closure of Ain Ky, . 

We have an isomorphism a > o, from (A/mA)* > Gy, = Gal(Km/k) 
where o, is characterized by o4(A) = Ca(A) for all A € Am. 

Let J = {oq | a € F*}. The fixed field of J is denoted by Kj, and by 
analogy with Q(¢,,)* is called the maximal real subfield of K,,. We denote 
by O* the integral closure of A in K;,. The prime oo of k splits completely 
in K*. Each prime 8%. of Km which lies above oo is totally and tamely 
ramified above K;«. The map a — o, gives rise to an isomorphism: 


(A/mA)*/J & Gt = Gal(Ky/k) . 


Let S,, be the set of primes of K,, lying over S = {oo} and S>, the set 
of primes of K> lying over oo. We have |S7*| = |S,,| = ®(m)/(q — 1) and, 
by Proposition 14.8, 


hk, = ho, RQ and hyt =hogRY . 


In these equations, RY is the g-regulator of the S,,-units and Re is the 
q-regulator of the S*-units. 

Our next goal is to give analytic formulas for the class numbers of O,, 
and O*. One approach would be to specialize Theorem 14.13 to the two 
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abelian extensions K,,/k and K;,/k. However, that theorem involves some 
advanced material in its statement, for example the notion of the Artin 
conductor whose very definition depends on the Artin reciprocity law. We 
prefer to derive the necessary formula in the present circumstance in a 
much more elementary fashion. 

Let’s reconsider the Artin L-series L.4(w, x), which appeared in Chapter 


14, 
La(w,x) = S> x) 
D 


In this sum, D varies over all effective divisors of k prime to S = {oo}. 
Recall x(D) = x((D, Km/k)) if D is prime to the conductor of x and 
x(D) = 0 otherwise (see the discussion of these points in Chapter 9). The 
effective divisors prime to oo are in one-to-one correspondence with ideals 
in A. We make this identification. The definition of the conductor of x can 
be made very concrete in the present situation. Let Gm(x) G Gm be the 
kernel of y and let K(x) be the corresponding subfield of K,,. Then the 
conductor of y, or rather the part of the conductor which is prime to oo, 
is given by the ideal (m,) C A where m, is the monic divisor of m of least 
degree such that K,,(x) C Km, CG Km. Since Gal(Km/Km,) © Gm(x) we 
see that x can be viewed as a character on Gm, = (A/m,A)*. 

Each ideal D in A has a unique monic generator, say, a. It follows 
immediately from Proposition 12.10 that if D is prime to (m,), then 
(D, Km, /k) = a as elements of G,,,. Thus, if we define, as we have been 
doing, x(0.) = x(a), we can rewrite the L-series as follows: 


x(a) 
jalv 


La(w, x) — 
a€A 
a monic 


We have used ND = N(aA) = #(A/aA) = q%°%* = |a|. For emphasis, 
in this equation x is being considered as a character on (A/m,A)*. From 
now on we make this convention: whenever a Dirichlet character modulo 
m occurs in an L-series, L4(w,x), we regard x as a character modulo m,. 
With this convention, the Artin [-series and the Dirichlet L-series coincide. 

Our Artin L-series (associated to K,,/k with S = {oo}) have been re- 
vealed to be nothing more than the Dirichlet L-series that we treated in 
some detail in Chapter 4. Proposition 4.3 shows that when y # yo, L(w, x) 
is a polynomial in q~™ of degree at most M, —1, where M, = deg m,. Set- 
ting u = q-™” and La(w,x) = La(u, x) we recast the content of Proposition 


4.3 as follows: - 
Latux)= Sd) x(a)useee. (1) 


a@ monic 
dega<M, 


At this point we need to make a distinction between characters. For any 
monic polynomial m, we call a character of (A/mA)* even if x(a) = 1 for 


16. The Class Number Formulas 291 


all a € F*. Otherwise, x is said to be an odd character. If we think of x 
as a Galois character on G,,, then it is even if and only if it is trivial on 
J = Gal(K,,/K;,). For this reason, one sometimes calls an even character 
a real character and an odd character an imaginary character. This is all 
done by analogy with the number field case. 

Recall that the value w = 0 corresponds to the value u = 1. We will need 
the following result. 


Lemma 16.6. If x 4 x is an even character, we have La(1,x) = 0. 
Proof. If x is even, a € F*, and a € A is monic, then x(aa) = x(a). Thus, 


La,x)= YS) xla)=(q-1)* So x(a) =0. 


a monic aX#0 
dega<M, deg a<M, 


The last equality is a consequence of the following facts: x is not trivial, 
x(a) = 0 if (a,m,) #1, and the set {a € A | (a,m,) =1, dega < M,} is 
a set of representatives for the group (A/m,A)*. 


Proposition 16.7. We have 


(a) 6o,,(w) =¢Ca(w) TT La(w,x) . 
X#FXo 
(0) Cox (w) = I] La(w,x) 
X#Xo 


The first product is over all non-trivial Dirichlet characters modulo m and 
the second is over all non-trivial even Dirichlet characters modulo m. 


Proof. Both formulas are special cases of Proposition 14.11. To justify the 
second formula, note that even characters are the characters of (A/m)*/F* & 
Gm/J = GY, = Gal( Kz /k). 

It is possible to give a proof which avoids Artin L-series and just uses 
properties of Dirichlet L-series. One combines Lemma 4.4 with Proposition 
12.10 and the definition of €o,,(w) to get the first equality. The second 
equality can also be done in a similar manner. This method is especially 
easy to carry out when m is irreducible. In the general case, there are 
technical difficulties introduced by having to consider conductors. 


Theorem 16.8. We have 


(a) hx, = II ( S> x(a)) II ( ‘> —dega x(a)) . 


x odd a monic xX even a monic 
dega<M, X#Xo dega<My, 


het = lI ( S> —dega x(a)) . 


xX even @a@ monic 
X#Xo dega<M, 


oo~ 
on 
— 
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Proof. Recall that 
Co,,(w) = J] @-NP~”) éx,, (w) 


BESm 
and 
¢ _ Lim (q_”) 
Km Lge )(l = gi) | 

where Lx, (u) is a polynomial whose value at u = 1 is hx,,. 

Since every prime in S,, has degree 1, we have 1 -N~” = 1—q™” for 
all $8 € S,,. So, combining the last two equations and switching to the “w” 
language, we find 


a(m)_, LK, (u) 
Gon (w) = (Lu) Pt HK 
By Proposition 16.7, part a, we find 


1 


60, (w) = 1 — qu 


]] Lax - 


XFXo 


Now, combining these last two equations and rewriting slightly, we find 


Tr L U, X 
Licn(u) = Tf La(uyx) TT “Ae, (2) 
x odd x even 
X#Xo 


We have used the fact that the number of non-trivial even characters 
is ae — 1. This is because the set of even characters are in one-to-one 


correspondence with the characters of (A/mA)*/F*, a group with ae 
elements. 


We would like to just substitute u = 1 into Equation 2, but we must first 
deal with the expressions [4 (wu, x)/(1 — u) when x is even and non-trivial. 
By Lemma 16.6, the numerator of these expressions are zero at u = 1. We 
can apply L‘H6pital’s rule and Equation 1, which gives an explicit formula 
for the polynomial L4(u, x) to derive 


lim Lalu, x) = — S- dega x(a) , 


url JLl—u 
a monic 
dega<M, 
whenever x is even and non-trivial. 
The proof of part a of the theorem now follows immediately by taking 
the limit as u > 1 in Equation 2, and using Equation 1 once again. 
The proof of part (b) follows along exactly the same lines using the 
fact that the even characters are in one to one correspondence with the 


characters of (A/mA)*/F* & G+, and the fact that |S*| = |S,,| = oir). 


Before we can state and prove the main theorem of this chapter, we need 
three more preliminary results. 
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Lemma 16.9. Let m be a monic polynomial and suppose that » is a gen- 
erator of Am. If b is a polynomial prime to m, then opA/r € Kz. 


Proof. By definition of the automorphism op we have opA = Cp(A). In 
particular, when a € F*, we have ogA = Cg(A) = aA. Thus, 


Ta(Ovr/A) = Op0aA/Tar = O4(AA)/AA = anA/A . 


It follows that op,A/A is fixed for all elements in {o, | a € F*} = J, the 
Galois group of K,,/K;.. The result follows. 


It follows readily from the results of Chapter 12 that the elements a,A/A 
are units. If m is not a prime power, then 4 itself is a unit. If m = P® is 
a prime power, then A generates the unique prime ideal in O,, lying above 
P. It follows easily that o)A is another such generator and therefore opA/A 
is a unit. Units of this type will play a key role in what follows. 

Now that we know o)\/. is a unit in O* our next task is to determine 
its divisor as an element of K;,. In Chapter 15, using results developed in 
Chapter 12, we showed that there is a primitive m-torsion point Am» € A 
and a prime 8. of K,, lying over oo such that for all monics a € A with 
(a,m) = 1 and dega < M we have 


ord,-1g, (Am) = (¢-1)(M —dega-—1)-1. (3) 


See Proposition 15.17 and its proof. 

We need to allow a to vary somewhat more freely. For a € A with 
(a,m) = 1, define (a) to be the unique polynomial c with 0 < degc < M 
and a =c (mod m). Define fm(a) = (q¢ —1)(M — deg(a) — 1) — 1. We can 


then rewrite Equation 3 as follows: 


ord = 193. (Am) = fm(a) . (3') 


The advantage is that Equation 3’ is valid for any a prime to m. 

The ®(m)/(q—1) primes in S,, = {07 1 | a monic, dega < M,(a,m) = 
1} are all the primes in K,, lying over oo. Let Bt, be the prime of K;7, 
lying below B... Then S* = {o7'P5 | a monic, dega < M,(a,m) = 1} 
are all the primes of K;* lying above oo. Since apAm/Am is a unit in OF, 
the next proposition completely determines its divisor. 


Proposition 16.10. Fora,b€ A monic and prime to m we have 


fin (ab) — fm(a@) . 


d_- Am m) = 
or oximt, (7% / ) q—1 
Proof. Using Equation 3’ above, we find 
ord ,-19, (o,Am) = ord 14-195, (Am) 


(q — 1)(M — deg(ab) — 1) —1= fm(ab) . 


| 
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Combining Equations 3’ and 4, we find 
ord -1g,) (FAm/Am) = Ord, -19, (GHA) Ord, 193 (Am) = fm(ab)—fm(@) - 


Finally, since o7/$8 is totally and tamely ramified over o7 +t, and 
O4Am/Am € Kz, by Lemma 16.9 we have 


fab) = fn(a) 


ord, -1m + (0,Am/Am) = q— 1 


1 
_ ord stg, (%Am/Am) = 

The units o,A/, are similar to the cyclotomic units om in the number 
field Q(¢,,). We now give the general definition of cyclotomic units in the 
function field case. 


Definition. Let V,, be the subgroup of K%, generated by the non-zero 
elements of A, and €,, = V,, 7 O*,. The group €,, is called the group of 
cyclotomic units in Kj. 

Note that constants are cyclotomic units since if a € F* we have a = 


CaAm/Am: 
Lemma 16.11. If m= P* is a power of an irreducible P, then the group 
of cyclotomic units, Em, is generated by F* and the set 


Tm = {0Am/Am | 6 monic, 0 < degb < degm, (b,m) = 1}. 


Proof. We will give the proof when m = P is irreducible and leave the 
case m = P',t > 1, as an exercise. 

Every non-zero element of Ap has the form o,p, where a varies over the 
non-zero polynomials of degree less that deg P. If u is a cyclotomic unit, 


then 
U= ] [(caAr)"* ; 


a 


where the exponents n, are in Z. Rewrite this equation as 


u=|[(oadrp/Ap)™ x Apr . 


Qa 


Consider the fractional Op-ideal generated by both sides. We find, Op = 
(Ap)”". Since (Ap) is a prime ideal, this implies S* ng = 0. It remains to 
show that we can restrict our attention to a@ monic. 

If a is not monic, a = ab with a € F* and b monic. Then 


OaAp/AP = Opn0gAp/Ap =a ovrAp/Ap : 


The lemma now follows immediately. 


Corollary. [fm = P*, every cyclotomic unit is in O7,". 


16. The Class Number Formulas 295 


Proof. It follows from Lemma 16.9 that the set T,, is contained in K;.. 
Since the constants, F*, are also in K;*, the result follows. 


We remark that both the lemma and the corollary are false if m is not a 
prime power, since then A, is a cyclotomic unit and it is not in K7. 

Note that the set T,, has the same cardinality as the rank of O*,. One 
might be tempted to think that it generates a subgroup of finite index. 
This is not always the case, but it is true if m = P° is a prime power. We 
will now show this and more when m = P is itself a prime. The following 
theorem is the analogue of Kummer’s theorem, Theorem 16.3. 


Theorem 16.12. Let m = P be a prime. Then, the group of cyclotomic 
units, Ep, 1s of finite index in Os" and 


* 


hor = [OF »Ep| . 
Proof. We begin by specializing and reworking the analytic class number 
formula, Theorem 16.8, part b, in the case where m = P, a prime in 
A=FY{T). 

Since P has no monic divisors except 1 and itself, we see that any non- 
trivial character y on (A/PA)* has P for its conductor. Let d = deg P. 


Then 
Ant = I] ( S- —dega x(a)) . 


xX even aq monic 
XFXo deg a<d 


Recall the definition, fp(a) = (q — 1)(d — deg(a) — 1) — 1. By Lemma 
16.6, we see that for x even and non-trivial, }°, monic, deg aca X(2) = 0. It 
follows that 


S> x(a)fp(a)=(q-1) S > —dega x(a) . 


a monic a monic 
dega<d dega<d 


Thus, our formula for h,+ can be rewritten 
m 
&( 


hye =(-™ TT (  x@) fea). 


x €VEN @ monic 
XFXo dega<d 


We want to apply the Dedekind determinant formula to rewrite the 
right-hand side of this equation. The group we are considering is Gt = 
Gal(K#/ k). Ifo € GE, then o coincides with o, for some a representing 
an element of (A/PA)*/F*. Define fp(c) = fp(a). It is easy to see that 
this definition is independent of the choice of a. Now, invoking Theorem 
16.4, part (b), we find 
(5) 


hict = +(q- 1)! et det[fp(ab) — fp(a)] = + det| PI FP) , 
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In this matrix, a and b vary over all monic polynomials which are distinct 
from 1 and have degree less than d = deg P. 

By Proposition 16.10, the determinant on the right-hand side is the same 
as 


det lord, ig (orp /AP) ; 


Call the absolute value of this determinant RY, Using this remark, Equa- 


tion 5, and the fact that hick = hos RY (see Proposition 14.8), we derive 


st? 
RY 
hos ~ a (6) 
Re 
P 


It remains to show that the right-hand side of this equation is equal to 
the index of €p in Ox. To do this we use the ideas that go into the proof 
of Proposition 14.3. Matters are even simpler in the present case, since all 
the primes above oo have degree 1. 

Before proceeding, let’s simplify the notation. Let U be a subgroup of 
OF" and S = Si. Let s = |S| = a For each monic a with dega < d, 
let B. =o7;'/PBt. Arrange the monics of degree less than d in some order 
with 1 being the first and label the coordinates of Z* with these monics. 

Define a map, /,, from K;, to Z°, which takes an element z to the s-tuple 
whose a-th coordinate is ordy, (x). Let H° be the subgroup of elements in 
Z* whose coordinates sum to zero. H° is a free group of rank s — 1. Using 
the fact that the primes in S have degree 1, it follows that if u € U, we 
have |,(u) € H°. 

Let T = {uj,ug,...,Us—1} be a set of elements in U. Consider the (s — 
1) x (s —1) matrix: 

T = fords, (ui)] 


where 1 <i < s—1 and a #11 varies over monics of degree less than d. 
We claim that the s-tuples {l,(w1),lg(u2),...,4q(us—1)} are linearly inde- 
pendent over Z if and only if det 7 #4 0. Moreover, if det 7 4 0, then the 
group generated by {lg(u1),lg(u2),...,lg(us—1)} has index |det 7| in H°. 
To see this, let e; € Z*® be the vector whose first coordinate is 1 and all of 
whose other coordinates are 0. Then, Z° = Ze; @ H° and we are reduced 
to considering the s x s matrix whose first row is e; and whose 1-th row, 
for 2 <1 < 8, is lj(u;_1). The determinant of this matrix is the same as 
the determinant of 7 (consider the cofactor expansion along the first row), 
which proves the assertion. 

Applying these general considerations to a set of fundamental units and 


to the generating set of cyclotomic units, we find that RY? = |[H° :1,(O3°)| 
P 
and RY = [H° :1,(€p)]. From this and Equation 6 we find 


hot = (lg(Of ) :Ig(Ep)} . 
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If U is any subgroup of the group of units containing F* we have an exact 
sequence (1) > F* — U + 1,(U) — (1). It follows that 
hot =[Of$ :Ep]. 


* 


Remarks. 


1. The main reason the case m = P, a prime, is so much easier than the 
general case is that all non-trivial Dirichlet characters have P as conductor. 
This makes it easy to use the Dedekind determinant formula in conjunc- 
tion with the analytic class number formula. In other cases, the conductor 
depends on the character and one must take care. In the case m = P*, a 
prime power, this is relatively easy to do (see the exercises). If m is not a 
prime power, keeping track of the conductor is quite difficult and requires 
rather sophisticated technique. 


2. Theorem 16.12 first appeared in Galovich-Rosen [1]. Soon thereafter it 
was generalized to K;* for arbitrary m (see Galovich-Rosen (2]) by following 
the methods of Sinnott [1]. These results were generalized in stages to the 
case of arbitrary global function field k as base and with ray class fields 
taking the place of K,,. See the work of Hayes [4], Shu [1], and Oukaba [1]. 
The most general case was handled by L. Yin [1]. 


We want to use Theorem 16.12 to help provide an analogue to Dirichlet’s 
theorem, Theorem 16.1, part b. For simplicity we will assume m = P, a 
monic polynomial of even degree d. 

It will be useful to define M to be the set of monic polynomials of degree 
less than d. 


Lemma 16.13. Assume q = |F| ts odd and that P is a monic irreducible 
of even degree d. Then k(VP) C Kj. 


Proof. Recall the factorization of the Carlitz polynomial 


P| Clu) _ II (u — X) 


AEAp 
A40 


Comparing constant terms on both sides shows (—1)%-} [JA =[]A = P, 
where the product is over all elements in Ap — {0}. 
The set of non-zero elements of Ap coincides with {o,Ap | a 4 0,dega < 


d}. Since every non-zero polynomial can be written uniquely as the product 
of a constant times a monic, we derive the following equation: 


(T] = TI are) =P. 


ack* aeEM 


The product of all the non-zero elements in a finite field is —1. Since q is 
d 
assumed odd and d is even, | is even. It follows that P is a q — 1-power 
in Kp, and so, a posteriori, a square. This shows that k(./P) C Kp. 
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To show k(VP) C Kj, note that Gp is cyclic. This shows there is a 

d 
unique quadratic extension of k inside Kp. We have just seen that 1 = 
|G| is even. It follows that k has a quadratic extension inside Kj. By the 


uniqueness, it must be k(VP). 


We now want to define a cyclotomic unit in k(P). Recall that o,A/A € 
KF for all a not divisible by P. The Galois group GE is isomorphic to 
(A/PA)*/F*. For the rest of the present discussion we are going to let a 
vary over (A/P)*/F*. Define 


( OaAP yo (7) 


7= sen(a)Ap 


a€(A/PA)*/F* 


where (a/P) is the quadratic character on (A/PA)*. The factor sgn(a), 
the leading coefficient of a, is included so that the quotient o,/sgn(a)A is 
independent of the class of a in (A/PA)*/F*. We also need to know that 
(a/P) is an even character, i.e., that it is equal to 1 on F*. This is true, 
since for a € F*, 


|P|—1 q?—1 q—1 


(a/P)=a 2 =qiri 2 =], 


d 
because, under our assumptions, = ; is an even integer. 


Lemma 16.14. The unit 7 is an element of k(VP). 


Proof. We have seen that k(VP) C Kj. An element o, € Gf is in 
Gal(K/k(/P)) if and only if b is a square in (A/PA)*/F*. Notice that 


Oba AP —(a/P) OpAp da (a/P) 
on=( JT] (—cmar- _oehP_ 
( sgn(ab)\ p Gaon 


The second factor is equal to 1, since }>,(a/P) = 0. If b is a square in 
(A/PA)*/F*, then (b/P) = 1 so (a/P) = (ba/P). Under these conditions 


the above equation shows oy7 = yn, which proves the lemma. 


Theorem 16.15. Let q be odd and P be a monic irreducible polynomial 
of even degree d. Let K = k(WP) and Ox the ring of integers of K (1.e., 
the integral closure of A in K ). Let poo be a prime of K lying over oo and 
€ a fundamental unit of OX such that ordy.(€) < 0. Finally, let n be the 
cyclotomic unit defined in Equation 7 (after an appropriate choice of Ap). 
Then 

ord, .. (7) 


MOK = ord (6) 


Moreover, €"°x = an for some a € F*. 
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Proof. By Propositions 14.6 and 14.7, we know that oo splits in K and 
that 

hk = ho, log, |€lp., = —ho, Ordy,, (€) - 
The last equality uses the fact that p.. has degree 1. 

The Artin character x of Gp = (A/PA)* corresponding to K/k is even 
and of order 2. The only prime of K which is ramified over k is the prime 
above P, from which it is easy to see that y(oa) = x(a) = (P/a) for all a 
not divisible by P. By the law of quadratic reciprocity (see Theorem 3.5), 
we find 


(P/a) = (—1)4*8 24°F “T° geno (a) #8? (a/P) = (a/P) , 


since deg P is even by hypothesis. 
Using the same method of proof that led to Theorems 16.8, we can derive 
the following class number formula for hx: 


hk = S- —dega x(a) = S> —dega (a/P) . 


a monic a monic 
deg a<d deg a<d 


Recall the definition fp(a) = (¢—1)(d—deg(a) —1)—1. By Lemma 16.6, 
we rewrite this formula as 


hk =(q-1)* )> (a/P) fr(a). (8) 

degacd 
Let $B8.. be a prime of Kp lying over py and let Ap be a generator of 
Ap such that ordy,Ap = (q — 1)(d — 1) — 1. As we have seen, such a 
generator exists and for all a not divisible by P we have ordy,,(o.A) = 
fp(a). Substituting this into Equation 8 and simplifying the result gives us 


he = (q~-1)"'ordp..(n™") . 


By Lemma 16.14, 7 € K. Also, 8. is ramified over p,, with ramification 

index q — 1. Thus, 
he = —ordy,.(7) . 

The first assertion of the theorem follows by combining this equation 
with hx = —ho, ord,,. (€), which has already been demonstrated. 

To prove the last assertion, consider a := ex 771. This is a unit in Ox 
and the first part of the proof shows that the ord of a at poo is zero. Since 
the divisor of a has degree zero its ord at the other infinite prime must 
also be zero. We have shown the divisor of a is the zero divisor. Thus, a is 
a constant. 


It may be useful to rewrite the unit 7 in a way that emphasizes the 
relationship of Theorem 16.15 to Theorem 16.1, part (b). Namely, 


) = II OarAP / I] OpAp . 


(a/P)=—1 (b/P)=1 


a monic b monic 
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Using the analytic theory of Drinfeld modules which we sketched in 
Chapter 13 we can make the analogy even more precise. The Carlitz mod- 
ule is associated to a rank 1 lattice Aw and a Carlitz exponential function 
eaz(u). Set ec(u) = e~z(u). In the exercises to Chapter 13 we point out 
that the set of P-torsion points for the Carlitz module which lie in C (the 
completion of the algebraic closure of k..) is given by 


{ec(amt/P) |0 < dega < d}U {0}. 


If we take Ap to be ec(7/P), then ogAp = Ca(Ap) = Calec(@/P)) = 
ec (at /P). Hence, we can rewrite 7 once more (considering it as an element 
of C) as follows: 


n= |] eclat/P)/ |] ec(b/P). 
(a/P)=-1 (b/P)=1 
a@ monic b monic 
The relationship of this unit with the expression after the logarithm in 
Theorem 16.1, part b, is now quite striking! 


The final goal of this chapter is to produce a function field analogue of 
the Carlitz-Olsen Theorem, Theorem 16.5. We need a definition. 


Definition. The relative class number, h;,, is defined to be hx,, /hx+. 


As it stands, h> is a rational number, but it is actually an integer. This 
can be shown algebraically, by showing that the mapping from the divisor 
classes of K* to the divisor classes of Ky», induced by extension of divisors, 
is injective. It also follows from Theorem 16.8 since by parts a and b of that 
theorem and the definition of h>, we deduce 


hn = II ( S- x(a)) . 


x odd = a monic 
deg a<M, 
The right-hand side of this equation is an algebraic integer and the left- 
hand side is a rational number. This shows h,, € Z. 
We will again assume m = P a monic irreducible of degree d. As before, 
the advantage of assuming m is prime is that we don’t have to worry about 
conductors. The last equation simplifies to 


hp= I] ( dS x). (9) 


x odd a monic 
deg a<d 
To be precise, the product is over odd characters on (A/PA)*. We will 
come back to this equation shortly. 
Define t = (q4 — 1)/(q — 1). Then t is the size of the set M of monic 
polynomials of degree less than d. For each character w of F* we construct 
at xt matrix Cw) as follows: 


C() = [b(sgn(ab))} . 
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More concretely, write ab = cP +r, where r € A and degr < d. The 
element r cannot be zero since neither a nor 6b is divisible by P. Then, 
sgn(ab) = sgn(r) = the leading coefficient of r. 

The following theorem is the function field version of Theorem 16.5. 
Theorem 16.16. 

hp=+ || detc(y). 
per" 
PFEWo 


Proof. For every non-trivial character w of F* define 


w= |] ( ‘> x(a) . 


X|\px = @ monic 
deg a<d 


Then Equation 9 can be rewritten 


hp = [| hy. (10) 


WFWo 


Fix a character ¢ on (A/PA)* whose restriction to F* is w%. Then every 
character with this property is of the form $y’ where x’ is an even character. 
We’l] return to this in a moment. - 

If % is a non-trivial character on F*, define» : A > C by (a) = 
w(sgn((a)))~! if P +a and 0 otherwise. 

Using this definition and the previous remark we can write 


v= TT ( SS r@d@s@) . (11) 


x’ even a monic 
dega<d 


Of course, with a monic and dega < d the term (a) is equal to 1. So why 
is it there? The point is that the product ¢(a)q(a) defines a function on 
(A/PA)* since both terms depend only on the congruence class of a modulo 
P, and, in fact, it defines a function on (A/PA)*/F* since if a € F* and a 
is monic with dega < d, 


b(aa)b(aa) = o(a)$(a)~(a)~* = $(a) = o(a)b(a) . 


The upshot is that the sums in Equation 11 over all monic a with dega < 
dcan be replaced with sums over (A/PA)*/F*. We then apply the Dedekind 
determinant formula to Equation 11 and deduce 


hy, = + det[¢(ab)b(ab)] = det[$(a)9(b)h(ab)] . 


By elementary properties of the determinant we can factor out $(b) 
from the b-th row and ¢(a) from the a-th column. The result is that 
the determinant is multiplied by (||, a)?, where the product is over all 
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a € (A/PA)*/F*. This product is the product of the elements of order two 
in this group and so its square is the identity. Thus, 


hy = +det[y)(ab)] = + det[y(sgn(ab))~"] = +det C(W") . 


The proof is concluded by substituting this result into Equation 10 and 
noting that ~~! runs through the non-trivial characters of F* as 7 does. 


Theorem 16.16 is taken from Rosen [4], where other similar results are 
proven and some applications to the size of hp are given. As far as I know, 
no one has published anything generalizing Theorem 16.16 although L. Shu 
has a preprint which addresses this problem. 

There is an elaborate theory of the so-called Stickelberger ideal and its 
relation to the relative class number. We have not discussed this circle of 
ideas. The interested reader may wish to consult Iwasawa [1] and Sinnott 
[1] for the case of cyclotomic fields. The (very general) function field case 
has been dealt with in Yin [3]. In a paper which is to appear, Yin [4], gives 
a new definition of the Stickelberger ideal in the function field case which 
enables him to deal with the class number itself and not just the relative 
class number. 


Exercises 


1. Prove Lemma 16.11 in the general case. More specifically, if m = 
P* is a prime power, show that the group of cyclotomic units in 
Kp: is generated by F* and the set {ogAp:/Ap: | a monic, (a, P) = 
1, dega < tdeg P}. 


2. Generalize the statement of Theorem 16.12 to the case where m = P® 
is a prime power and prove it. Hint: Prove first that a character of 
Gal(K p:/k) has conductor P® if s is the smallest power of P such 
that x(a) = 1 for all a = 1 (mod P%). The reader may wish to 
consult the proof of the classical case. See, for example, Chapter 8 of 
Washington [1]. 


3. Let P be a monic irreducible of even degree. In the proof of Theorem 
16.15 it is claimed that 


hk = Ss” —dega (a/P). 


a monic 
deg a<xdeg P 


Prove this formula. 


4. There is a natural map from Cl,4 — Clx,, induced by extension 
of divisors. Show this map is one to one, thereby giving a new proof 
that hy,+ divides hx,,. Hint: Assume D is a divisor of K7, and that 
ix,,/K#D = (a) for some a € K7,. Show at! =a € Kf, and deduce 
Kk; (a)/K;, is unramified everywhere. | 
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. Let P be a monic irreducible of degree d in F[T]. Since Gal(Kp/k) 
is cyclic of degree g? — 1, there is a unique subfield L C Kp such 
that [L : k] = q—1. Show that L = k(*WP) if d is even, and 
L=k(*W—P) if d is odd. 


. (Continued) Show that L C K if and only if ¢—1 | d. 
. (Continued) More generally, show [LN K3 : k] = (d,q—1). 


. (Continued) Let L+ = LN K. Derive analytic class number formulas 
for hyp+ and hy and show h;+ divides hy. 


17 


Average Value ‘Theorems 
in Function Fields 


In Chapter 2 we touched upon the subject of average value theorems in 
A = F[T]. The technique which we used goes back to Carlitz who as- 
sociated certain Dirichlet series with some of the basic number-theoretic 
functions and then expressed these Dirichlet series in terms of ¢,4(s). The 
zeta function is so simple in the case of the polynomial ring that it was 
possible to arrive at very precise results for the average values in question. 
For example, for n € A define d(n) to be the number of monic divisors of 
n. Then we showed 


YS) d(n)=qX(N+1). 
degn=N 


The corresponding classical result goes as follows. For n € Z, let d(n) 
denote the number of positive divisors of n. Then 


S > d(n) = xlogr + (2y—1)t+ O(y2) - 


l<n<zx 


The constant + is Euler’s constant. 

The relation of the two results is made clearer if one recalls that the size 
of a non-zero polynomial n of degree N was defined to be |n| = q’. Setting 
z= q’, the first equation can be rewritten 


S- d(n) = xlog,(z) +2, 


m monic 
|nj=x 
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which makes the analogy much clearer. 

In the first part of this chapter we consider average values of the general- 
izations of some elementary number-theoretic functions to global function 
fields. Everything becomes a little more complicated, as will be seen. T’he 
work is made a lot easier through the use of a function-field version of the 
famous Wiener-Ikehara Tauberian theorem. The proof of the function-field 
version of this theorem is relatively simple, being an application of the 
Cauchy integral formula. The idea behind this is due to Jeff Hoffstein. 

In the second part of the chapter we work over A = FIT] once again, 
but we consider average values of a “not so elementary” number-theoretic 
function. Namely, to each non-square polynomial m € A we consider the 
order O,, = A+ AV/m C k(./m) and its class number h,, = |Pic(Om)|. We 
will average these class numbers in various ways, thereby obtaining analo- 
gies to two famous conjectures of Gauss. We will discuss this connection as 
well as possible variants and generalizations. 


Let K/F be an algebraic function field with field of constants F with 
|F| = gq. We could set aside a few prime divisors, S, the primes “at infinity,” 
and work with the ring A of functions whose only poles lie in S. Our 
functions would then be defined on the ideals of A. Instead, we will work 
with functions on the semigroup of all effective divisors. Everything we do 
can be extended to the former situation without much difficulty. 

Let Dx be the group of divisors of K and Df be the sub-semigroup of 
effective divisors. We explicitly include the zero divisor as an element of 
Dr. Let f : Dk > C be a function and define 


G(sy= yo A, Q) 


the Dirichlet series associated to f. 

Since the use of the variable s will cause no confusion in this chapter, 
we go back to using s for the variable instead of w. Also, when we use D 
as a summation variable, it will be assumed that the sum is over D in DE 
with, perhaps, some other restrictions. 

For N > 0 an integer, define F(N) = )iaeg p=n f(D). Equation 1 can 
be rewritten 


Cs(s) = DY FIN)”. 
N=0 
Finally, define Z(u) as the function for which Z;(q~*) = ¢p(s). Then 
Z;(u) = 5° F(N)uN . (2) 
N=0 


In Chapter 5 we investigated the function by (K), the number of effective 
divisors of K with degree N. We showed that if N > 2g —2 (where g is the 
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genus of K) 

by(K) = he + 

n(K) =hx JIT 
Definition. Let f : DE — C be a function. The average value of f is 
defined to be 


Ave(f) Ddeg D=N f(D) — jj F(N) 


= lim = lum 
N->00 deg D=N 1 N—0o by (K) 
provided the limit exists. 


This definition is certainly the right one in the current context. In many 
interesting cases, the limit doesn’t exist. In that case the task is to find 
some simple formula for F'(N), or a simple formula plus an error term. We 
will give a number of examples after proving the next theorem which is 
the function field version of the Wiener-Ikehara Tauberian theorem. The 
original theorem is much more difficult to prove. See Lang [5]. A little later 
we will give a function field version of greater generality. 

Before stating the theorem we have to establish a convention which will 
be used throughout the remainder of this chapter. The function q~° is easily 
seen to be periodic with period 277/log(q). The same therefore applies to 
all functions of g~* such as our functions ¢+(s). For this reason, nothing is 
lost by confining our attention to the region 

B={sec|--— <a) < 2a}. 
log(q) log(q) 

In what follows, we will always suppose that s is confined to the region 
B. This makes life a lot easier. For example, ¢x(s) has two simple poles, 
one at s = 1 and one at s = 0 if s is confined to B, but it has infinitely 
many poles on the lines #(s) = 1 and R(s) = 0 if s is not so confined. 


Theorem 17.1. Let f : Dk - C be given and suppose ¢f(s) converges 
absolutely for R(s) > 1 and is holomorphic on {s € B | R(s) = 1} except 
for a simple pole at s =1 with residue a. Then, there is ad <1 such that 


S| f(D) = alog(q)q% +O(@"%) . 
deg D=N 


If Cf(s) — -% is holomorphic in R(s) > 6’, then the error term can be 


s—1l 


replaced with O(g° N \, 


Proof. The hypothesis implies that Z;(u) is holomorphic on the disk {u a 
C | |u| < q-+} with the exception of a simple pole at u = q~+ (just use the 
transformation s > u = q~*). What is the residue of Z;(u) at u = q7'? 
The answer is given by 

—s —l 
q°—4q log(q 
TAT (5 —1)64(s) = - 8 


—— me 


‘ _ yvl —}; 
im tu d )Z5(u) im s—1l qd 
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Next, notice that since the circle {u € C | |u| = q~+} is compact, there 
is a d <1 such that Z;(u) is holomorphic on the disk {u € C | |u| < q~°} 
except for the simple pole at u = q~!. Let C be the boundary of this disk 
oriented counterclockwise and let C, be a small disc about the origin of 
radius « < q~!. Orient C, clockwise, and consider the integral 


1 Z5(u) 
N+1 


271 Jo.+c U 


By the Cauchy integral formula, this equals to sum of the residues of 
Z;(u)u-“—! between the two circles. There is only one pole at u = qu! 
and the residue there is 


—-——a 


q 


lo 
BUD) 5 gh} = —allog(a)aN 


On the other hand, using the power series expansion of Z;(u) about 
u = 0, we see 

1 Z(u) 

ori c, UNF 


du=-—F(N). 


It follows that 


1 Z+(u) 
_ N, 4. f 
F(N) = alog(q)q” + sah NG du 


Let M be the maximum value of |Z;(u)| on the circle C’. The integral 
in the last formula is bounded by Mq*°™ , which completes the proof of the 
first assertion of the theorem. 

To prove the last part, we may assume 6’ < 1 since otherwise the error 
term would be the same size or bigger than the main term. If ¢r(s) — 
a/(s — 1) is holomorphic for R(s) > 6’, then Z;(u) is holomorphic on the 
disc {u € C | |u| < q7® } except for a simple pole at u = q~!. In that case 
we can repeat the above proof with the role of the circle C' being replaced 
by the circle C’ = {ue C | |u| = q7*}. The result follows. 


We illustrate the use of this theorem by investigating the generalization 
of the question: what is the probability that a polynomial is square-free? 
In Chapter 2 we showed, after making the question more precise, that the 
answer is 1/¢,(2). 

What would it mean for a divisor to be square-free? This is initially 
confusing, but only because we write the group law for divisors additively. 
A moment’s reflection shows that the following to be the right definition. 
An effective divisor D is square-free if and only if ordpD is either 0 or 1 
for all prime divisors P, i.e., if and only if D is a sum of distinct prime 
divisors. 
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Proposition 17.2. Let f : Dt —> C be the characteristic function of the 
square-free effective divisors. Then F(N) = digegp=n [(D) ts the number 
of square-free effective divisors of degree N. Given € > 0, we have 


1 _ hk 
PN) = Ca) Hq = 1) 
Moreover, Ave(f) = 1/€K(2). 


Proof. Recall that for divisors C and D we have N(C+ D) = NC ND. 
From this we calculate 


f( (D) 1 ¢ (s) 
Cr(s) = ~ 2 NDe = ~~ » NDs — =I(+ Bs) > (28) ) 


D square—free 


gh +0.(q4tO%) | 


By the function-field Riemann Hypothesis we know that all the zeros 
of Cx (s) are on the line R(s) = 5. Thus 1/¢x(2s) has no poles in the 
region R(s) > 4. On the other hand, we know that in this region ¢x(s) is 
holomorphic except for a simple pole at s = 1. 

Choose an € > 0 and set 6’ = 4 +e, Then all the hypotheses of Theorem 
17.1 apply to ¢y(s) and we find 


F(N) = alog(q)q™ + O- (qt) , (3) 


where a is the residue of (x (s)/CK (2s) at s = 1. We have seen in Chapter 
5 that the residue of Cx (s) at s = 1 is 


hr 
q9-*(q — 1) log(q) | 


It follows that a = px /¢x(2). Substituting this information into Equa- 
tion 3 completes the proof of the first assertion of the proposition. 

To prove the second assertion recall that Ave(f) = limyj0 F(N)/bn(K) 
and that for all N > 2g —2, by(K) =hx(q¥~9t! —1)/(q—1). By the first 
part of the proposition we find, for N in this range, 


(4) 


pK = 


F(N) 1 @N-97! ~34e6)N 
by (K) = Gea ghost wy tcl ) . 


Now, simply pass to the limit as N tends to oo. 


As a second example, we generalize the function o(n), the sum of the 
divisors of n. If D is an effective divisor, what is a divisor of D ? A little 
thought leads to the following definition; C is a divisor of D if and only if 
D—C > 0. D has only finitely many effective divisors in this sense and we 


define o(D) to be 
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The reader should not have trouble being convinced that this is a sensible 
generalization of the usual “sum of the size of divisors” function. 

The Dirichlet series ¢,(s) is equal to ¢x(s)CxK(s — 1), as can be verified 
by the following calculation: 


Cic(s) (8 ~ 1) = ( > wa) (2 war) =D  Nc)ND 


B+C=D 


Proposition 17.3. Leto : DE —> C be the sum of norms of divisors 
function defined above. Given ane > 0, we have 


S> o(D) = Cie (2) eg? + 0 (g4O¥) | 


—]1 _ 
deg Do=N q9-}(q—1) 


Proof. Since ¢,(s) = ¢x(s)¢xK(s — 1), it has a pole at s = 2, a double pole 
at s = 1, and a pole at s = 0. The conditions of Theorem 17.1 do not hold! 
However, we can make progress by substituting s +1 for s. This yields 
Co(s +1) = Cx (s+ 1)¢x(s). This function has a simple pole at s = 1 and 
is otherwise holomorphic on the region #(s) > 0. Choose an € > 0 and set 
6’ = € in Theorem 17.1. We have ¢,(s + 1) is holomorphic on the region 
K(s) > € except for a simple pole at s = 1 with residue ¢€x(2)px. 

We are all set to apply Theorem 17.1, except that we need the expansion 
of ¢,(s + 1) as a power series in q~* = u. This is easy, 


ms sae ->( d iD)! ” 


N=0 deg D=N 


Y(r" S- o(D))u | 


deg D=N 


Co(s + 1) 


| 


It follows that 


g7N S° a (2) px log(q )q NV +40,.(q°%) 


deg D=N 


Multiply both sides of this equation by g™ and use the explicit expression 
for px given by Equation 4. This finishes the proof. 


It is amusing to carry matters a step further. Divide both sides of the 
equation in the proposition by by(K) and use the reasoning at the end of 
the proof of Proposition 17.2. We find that the average of o(D) among all 
effective divisors of degree N is approximately ¢%(2)q’. 


As a final application of these methods we want to investigate the func- 
tion d(D), the number of effective divisors of D. More precisely, d(D) = 
#{CEDE|O<C< Dt}. 
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It is relatively easy to check that ¢y(s) = Cx(s)?. This function has a 
double pole at s = 1 so Theorem 17.1 doesn’t immediately apply. Moreover, 
it is hard to imagine any simple trick reducing us to the conditions of that 
Theorem. What is needed is a generalization. This is provided by the next 
result. 


Theorem 17.4. Let f : DE —+ C and let ¢y(s) be the corresponding Dirich- 
let series. Suppose this series converges absolutely in the region R(s) > 1 
and ts holomorphic in the region {s € B | R(s) = 1} except for a pole of 
order r ats =1. Let a =lims_,1(s — 1)"C¢(s). Then, there is a5 <1 and 
constants c_; with 1 <i<r such that 


rM= YO sW)=8 (LeU EY )ca’) +0). 


deg D=N 


The sum in parenthesis is a polynomial in N of degree r—1 with leading 
term 

] r 

og(q) Ov Nv7} 

(r —1)! 


Proof. As in the proof of Theorem 17.1, we can find a 6 < 1 such that 
Z+(u) is holomorphic on the disc {u € C | |u| < q~°}. We again let C be the 
boundary of this disc oriented counterclockwise and C. a small circle about 
s = 0 oriented clockwise. By the Cauchy integral theorem, the integral 


is equal to the sum of the residues of the function Zs(u)u~ ~! in the region 
between the two circles. There is only one pole in this region. It is located 
at_u = q7!. To find the residue there, we expand both Z p(u) and u~“—? in 
Laurent series about u = q~+, multiply the results together, and pick out 
the coefficient of (u—q~')~?. 

By using the Taylor series formula or the general binomial expansion 
theorem we find 


_N- — (-N-l\ , “15 
u-N baa ( ; Jeu ‘) 
j=0 


The Laurent series for Z(u) has the form 


with c_, 4 0. 
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Multiplying these two series together and isolating the coefficient of 
(u —q~*)~! in the result yields 


— N-1 
—~N- N oa ~i-1 
Res,-g-1Z7(uju- 71 = ght? ) (TTY) 


| 
en) 
= 
> 
o) 
J 
———~ 
= 
. | 
NY 
Q, 


To get the last equality we simply transformed i to —7 and redistributed 
one factor of q. 

It is easy to see that (~*~) = (-1)* (NR so the residue can be rewrit- 
ten as 


As in the proof of Theorem 17.1, it now follows that 
- N+i-1 , 
_ _N S } _g)*) 4 6N 


Finally, we must prove the assertions about the term in parenthesis. First 
of all, it is clear that when k > 0, (” th) is a polynomial in N of degree 
k, and that its leading term is k!-!N k . Thus the sum in parenthesis is a 
polynomial in N of degree r — 1 and its leading term is 


Cor 


(r—1)! 


It remains to relate a = lims_,1(s — 1)"¢¢(s) to c_,. This relationship 
follows from the calculation 


(—q)" NT! 


cry = lim (u—q7')"Z;(u) 


as (=) (s —1)"¢r(s) = (-=22) a 


Substitute this expression for c_, into the previous expression for the 
leading term of the sum in parentheses and we arrive at 


log(q)" 
(r—1)! 


for the leading term. This completes the proof. 


| 
=) 


a Nv7} 


Corollary. With the assumptions and notation of the theorem, we have, 
as N —> oo, 
log(q)” 


F(N) ~ (r—-DI g’ NT-} . 


(Here, “~” means “is asymptotic to”). 
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In order not to clutter the statement of Theorem 17.4, we did not in- 
clude a refinement similar to the last part of the statement of Theorem 
17.1. However, it is easy to establish the following generalization in the 
present case. Suppose the hypotheses of Theorem 17.4 hold and that P(s) = 
)_,-1 @-i(s—1)~* is the polar part of the Laurent expansion of ¢7(s) about 
s = 1. If ¢¢(s) — P(s) is holomorphic in R(s) > 6’, then the error term can 
be replaced with O(go N ). The proof is the same as in the earlier situation. 


For general Dirichlet series there is a generalization of the Wiener-Ikehara 
Tauberian theorem, which is analogous to Theorem 17.4. It is due to H. De- 
lange [1]. A statement of the theorem is given in Appendix II of Narkiewicz 
(1). 

We now want to apply Theorem 17.4 to the divisor function d(D) on 
Dt. 

Proposition 17.5. Let K/F be a global function field and d(D) the divisor 


function on the effective divisors. Then, there exist constants ux and AK 
such that for fixed € > 0 we have 


S> d(D) = q% (AkN +x) + O.(q°%) . 
deg D=N 


More explicitly, AK = h?.q?~*9(q —1)~?. 


Proof. We have already seen that ¢4(s) = Cx (s)*, a function which has a 
double pole at s = 1 and is otherwise holomorphic for #(s) > 0. Choose 
e > 0. Notice that lim,_,1(s —1)*¢x(s)* = p%. Applying Theorem 17.4 and 
the remarks given after that theorem we find there are constants Ax and 
tux such that 


S> d(D) = q™ (AkN+ux) + O-(9°%) . 
deg D=N 


Applying the formula for the leading term of the polynomial in paren- 
thesis given in the statement of Theorem 17.4, we find 


= 28g" hi 


log(q)” _ log(q)? 2 
AK (r — i ~ ap PK q?9-2(q — 1)? © 


This finishes the proof. 


Another interesting fact is that the average value of d(D) over the effec- 
tive divisors of degree N is asymptotic to 


hk 
q9-*(q—1) 
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This is easy to establish on the basis of the proposition. 

We have merely touched on the fringes of a large subject. The reader who 
wishes to explore this area further should consult the book of J. Knopf- 
macher [1]. We will not pursue these matters here. Instead, we turn to 
another topic in the area of average values of arithmetic functions. 


In his famous work Disquisitiones Mathematicae, C.F. Gauss considered 
at length the arithmetic of binary quadratic forms ax? + 2bry + cy” defined 
over the integers Z. The discriminant of such a form is by definition D = 
4b* —4ac (because of the restriction that the coefficient of ry be even, Gauss 
considered only even discriminants). He defined an equivalence between 
such forms and showed that equivalent forms have the same discriminant. 
Moreover, he showed that the number of equivalence classes of forms with 
the same discriminant is finite. Call that number hp. Based on extensive 
numerical evidence he made two conjectures about the average value of 
these class numbers jp. Slightly reformulated, they read as follows. 


1. Let D = —4k vary over all negative even discriminants with 1 <k < 
N. Then 4 
TV 3 
S> hp~ NE. 
open 21¢(3) 


2. Let D = 4k vary over all positive even discriminants such that 1 < 


k<N. Then 


Se) fe) 


Ar? 
S- hpRp ~ —— N 
1<keN 21¢(3) 


The number Rp in the second conjecture is closely related to the regula- 
tor of the real quadratic number field Q(/D). In fact, the both conjectures 
can be reformulated in terms of orders © in quadratic number fields where 
the class numbers / are interpreted in terms of the size of the Picard group 
of O, Pic(O), i.e., invertible fractional ideals of O modulo principal ideals. 

Both of these conjectures have been proven. There is a long history. The 
interested reader can find a brief review of all this in Hoffstein-Rosen [1]. 

We will consider the function field analogue of Gauss’s conjectures. As 
usual, instead of Z and Q we consider the pair A = F(T] and k = F(T). 
For the remainder of the chapter, we assume that the characteristic of F is 
odd. Let m € A be any non-square polynomial, and consider the quadratic 
function field K = k(,/m). Write m = mom?, where mo is square-free. The 
polynomial mo is well defined up to the square of a constant. Define O,, to 
be the ring A+ A,/m c K. It is an A-order in K, i.e., it is a ring, finitely 
generated as an A-module, and its quotient field is K. 


Proposition 17.6. With the notations introduced above, the integral clo- 
sure of Ain K 1s Om,. The ring Om ts a subring of Om, and the polynomial 
my, 18s a generator of the annihilator of the A-module O,,/Om. Finally, if 
O is any A-order in K, then O = O,, for some m€ A. 
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Proof. Since the characteristic of F is odd, it is easy to see that K/k is a 
Galois extension. Let o generate the Galois group. 

Clearly, K = k(./m) = k(,/mo). Every element in K has the form r + 
s,/mo for suitable r,s € k. The automorphism o takes Mo to —/mMo. 

Suppose r+ s,/mo is integral over A. Applying o we see that r—s,/mo is 
also integral over A. Thus, so is the sum and product of these two elements, 
i.e. 2r and r* — mos? are integral over A. Since A is integrally closed, we 
have 2r € A and r? — mos? € A. We may divide by 2, so r € A and it 
follows that mos* € A. Since mo is square-free, we must have s € A. We 
have proved that if r+s,/mo is integral over A, then r+s,/mo € A+A,/mMo. 
The converse is clear, so our first assertion is established. 

From the definitions, O,, = A+ A\V/m = A+ Ami\/mo C A+ Amo = 


O,. It is then immediate that as A-modules 
Om, /Om = A/m,A , 


which proves the second assertion. 

Let O be any A-order in K. One can easily show that every element of 
O is integral over A. Since 1 € O by definition, we have A C O. Since K is 
the quotient field of O there is an element a € O such that a ¢ A. By the 
first part of the proof we can write a = a+ b,/mo with a,b € A and b 40. 
It follows that b,/mo € O with b € A — {0}. Choose m; € A — {0} to bea 
non-zero polynomial of least degree such that m1,/mo € O. Set m = mom?. 
We claim that O = O,,. It is clear that O,, C O, so we must show the 
reverse inclusion. Suppose a + b,/mo € O with a,b € A. By the division 
algorithm in A we can write b = cm; +r, where c,r € A and either r = 0 
or degr < deg m . Multiply this relation on the right by ,/mo and we can 
deduce that r,/mo € O. Since m, is a non-zero polynomial of least degree 
with this property, we conclude that r = 0. Thus, a = a+ cm ,,/mo € On, 
and we are done. 


Definition. Let m € A, m not a square, and let O,, C k(,/m) be the 
A-order described above. Pic(O,,), the Picard group of Om, is the group 
of invertible fractional ideals of O,, modulo the subgroup of principal frac- 
tional ideals. The class number h,,, is defined to be the cardinality of this 
group (we will see shortly that h,, is finite). 


If mo is square-free, then as we have just seen, O,,, is the integral 
closure of A in K = k(,/mo). In this case, Om, is a Dedekind domain, 
Pic(Om ) = Cl(Om,), the class group of O,,,, and hm, is the usual class 
number. Moreover, hm, is finite by Proposition 14.2 (take S to be the 
primes of K lying above oo). 

Before going further with this analysis, we need another definition. If 
m € A, m a non-square, define y,(a) as follows: 
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Recall that if P is irreducible, then ym(P) = 0 if P|m, and if P 7m then 
Xm(P) = 1 if m is a square modulo P and —1 otherwise. If a is a product 
of irreducibles, one extends y,,(P) by multiplicativity; i.e., ifa = The: P,, 
then Xm(a) = [Tiz1 Xm(Pi). 

If m = mom? we have Xm(a) = Xm,(@) whenever (a,m) = 1. However, if 
P is an irreducible such that P|m, and P + mo, then we have xm(P) = 0, 
whereas V¥m,(P) #4 0. 

Define L(s, Vm) as follows: 


Notice that if m = mom?, we have 


L(s,xm) = |] (1 - en) L(8,Xmo) - 


P|m, 


When ™ is square-free, the next proposition shows that L(s,xm) is 
closely related to the Artin L-function associated to the abelian extension 


k(/m) /k. 


Proposition 17.7. Suppose m is square-free. Consider the quadratic ex- 
tension K = k(./m) of k. Let Loo(s8,X%m) be 1 if oo ts ramified in K, 
(1—q7*)7! if co splits in K, and (1+ q7%)~} if 00 is inert in K. Then 


Leo ($; Xm) LS; Xm) 


is the Artin L-function associated to the unique non-trivial character of 


Gal(K/k). 


Proof. We have seen that A+ A,/m is the integral closure of A in K. The 
discriminant of this ring over A is 4m. Since 4 is a non-zero constant, a 
prime P of A is ramified if and only if it divides m. 

Let L(s,x) be the Artin L-function associated to the unique non-trivial 
character y of Gal(K/k). If P is a finite prime, x(P) = 0 if P is ramified 
and x(P) = x((P, K/k)) if P is unramified. By the definition of the Artin 
symbol, (P, K/k) is e if P splits, and o if P is inert (o being the non-trivial 
element of the Galois group). Thus, y(P) = 1 if P splits and y(P) = —1 if 
P is inert. By the decomposition law in quadratic extensions (take / = 2 in 
Proposition 10.5), P splits in K if and only if ym(P) = 1. Thus, for finite 
primes x(P) = x¥m(P). At oo we know that |oo| = q so (1 — x(co)q78)7! 
is 1 if oo is ramified, (1 —q7*)7! if oo splits, and (1+ q7*)~' if oo is inert. 
Thus, (1 — x(00)|o0|~%)~1 = Loo(s, Xm). We have shown that D(s,y) and 
Doo(8,Xm)L(S, Xm) have the same Euler factors for all primes. Thus, they 
are equal. 
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We are now in a position to state the connection between L(1, xm) and 
class numbers. We begin with the case of m square-free. This relation is 
proven in the thesis of E. Artin (see Artin [1]). We will show, in a little 
while, how to generalize this result to the case of non-square polynomials 
Mm. 
If m € A, recall the definition of sgn.(m). This is 1 if the leading coefh- 
cient of m is a square in F* and is —1 if it is not. 


Theorem 17.8A. Let m € A be a square-free polynomial of degree M. 
Then, 


1) If M is odd, L(1,x%m) = aia Pan: 


2) If M is even and sgno(m) = —1, L(1,xm) = ae hm. 


3) If M is even and sgno(m) = 1, L(1,x%m) = 4 hmRm. 


V\ml 


Here, Rm is the regulator of the ring Om. 


Proof. Set K = k(,/m). From Proposition 17.7 and Proposition 14.9 we 
derive 


CK(s) = Ck (8) Loo (8, Xm) L(8, Xm) 


Multiply both sides of this equation by s — 1 and take the limit as s > 1. 
One finds 


hx a 
q9~1(q — 1) log(q) 7 q*(q — 1) log(q) Loo(l, Xm)L(1; Xm) ' 


Simplifying, we obtain 


heq 9 = Loo(1,Xm)L(1, Xm) - (5) 


Proposition 10.4 and the following remarks show that the genus, g, of K 
is M1 in case 1 and & — 1 in cases 2 and 3. 

Proposition 14.6 shows that in case 1, 00 is ramified, in case 2, oo is inert, 
and in case 3, oo splits. By Proposition 14.7, we find hy, = hk, hm = 2hxr, 
and hy, Rm = hx in cases 1, 2, and 3, respectively. 

~1 


Let’s consider case 1. We have g = “== and L.o(1,Xm) = 1. Also, hm = 


hx. Substituting this information into Equation 5, and noting \/|m| = qz, 


we find 
hum 
my _ 7, 


|m| 


1,Xm) : 


This proves case 1. 


318 Michael Rosen 


To deal with case 2 we note g = u —1, Lo(1,X¥m) = (1+¢q7')7!, and 
hm = 2hx. Substituting into Equation 5 once again, we find 


a — € + 2) 1.x . 


Case 2 of the Proposition is immediate from this. 

The last case, case 3, is done in exactly the same way. Here, g = 4 —1, 
Leo(1, Xm) = (1—q7')7}, and hx = hmRm. Substituting into Equation 5 
one more time yields 


—1 


jm| 


h, 


Case 3 follows easily, and this concludes the proof. 


Let’s now return to the general case. Let m be a non-square polynomial 
and write m = mom? with mo square-free. We will need the relation be- 
tween h,, and h»,,. While this is not a very difficult relationship to find and 
prove, it does take a rather detailed investigation which is off to the side 
of our main purpose. For this reason, we will merely state the result and 
refer to Lang [1], Chapter 8, Theorem 7, where the corresponding result for 
quadratic number fields is proven. The function field version, stated below, 
is proved in exactly the same way. For a general result (for number fields) 
along the same lines the reader may wish to look at Neukirch |1], Chapter 
1, Theorem 12.12. 


Proposition 17.9. Let m € A be a non-square and write m = mpm% with 
Mo square-free. Then, 


CI T] xo (P)IPIP) 


P\m, 


hm — Amo 


Implicit in this result is that the index [O*,, : O%]] is finite. If oo either 
ramifies or is inert, both groups are equal to F* and the index is 1. If co 
splits, then both groups have Z-rank 1 and one can show without much 
difficulty that the index is the same as the quotient of regulators Ry, /Rmyo.- 
We set R,, = Ry, = 1 in the first two cases. Then the relationship given 
by Proposition 17.9 can be rewritten 


hin Bem ~ hee : 
[= xmo(P)IPI“) . (6) 


v Jim V/|mo| P\|m, 


Theorem 17.8B. All the assertions of Theorem 17.8A remain valid if 
me€A is a non-square polynomial. 
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Proof. Suppose m = mom? with mo square-free. From the definitions, 
L(8,xXm) = L(8,xXmo) [[ (1 — xXmo(P)|PI™®) - 
P\|m, 
It follows that Equation 6 can be rewritten as 
AmRm 1 — Amy Rmo 1 
/|m| L(1, xm) /|Mo| L(1, Xmo) ) 


With the help of this equation, we see that Theorem 17.8B follows from 
Theorem 17.8A. 


By Theorem 17.8B, we see that the task of averaging class numbers 
reduces to the task of averaging the numbers L(1, xm). It turns out that it 
is no harder to average L(s, Xm) for any value of s. This is what we shall 


do. 
To begin with, notice that 


L(s,Xm) = Xm(n) = s-( S- Xm(n) )q~® . 
d=0 


m monic 
deg(n)=d 


Definition. For d € Z, d > 0, define 


Sa(xm)= > xXm(n) 


nm monic 
deg(n)=d 


Using this definition, we can rewrite L(s,%m) as ) 5-9 Salxm)q7*. This 
sum is actually finite as we see from the following Lemma. 


Lemma 17.10. [fm ¢ F* is not a square, Sa(Xm) = 0 ford > M = 
deg(m). 


Proof. By the reciprocity law, Theorem 3.5, we have 
(=) (= } = (—1)*F M4gon(m)4 . 


Call the quantity on the right of this equation cg. Then, we have x(n) = 
ca(n/m). Thus, if d > M, 


by the proof of Proposition 4.3. 
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Corollary. [fm ¢ F* is not a square, then 
M-1 
L(s,xXm) = > Sa(xm)q~® , 
d=0 


a polynomial of degree at most M —1 inq~®. 
Proof. This is immediate from the lemma and the previous remarks. 


Our goal is to understand the sums > deg(m)= u L(s,Xm) or the same 
sums restricted to monic polynomials m of degree M. By the corollary we 
are reduced to considering the sums ) Jgeg(m)=m Sd(Xm) where d < M. 

We will need the following definition and the subsequent proposition. 


Definition. Let M and N be non-negative integers and n a monic poly- 
nomial of degree N. Define ®,,(M) to be the number of monic polynomials 
m of degree M such that gcd(n,m) = 1. Define ®(N, M) to be the number 
of pairs (n,m) of monic polynomials such that deg(n) = N, deg(m) = M, 
and gcd(n,m) = 1. 


Note that 
S> On(M) = O(N, M). 


n monic 


deg(n)=N 
Also, it is obvious that ®(N, M) = ®(M,N). 
Proposition 17.11. 6(0,M)=q™ and if M,N >1, then 


®(N,M) = qM@t% € — “| . 


Proof. From the definition, (0, M) is equal to the number of monic poly- 
nomials of degree M which we know is q™. This proves the first assertion. 
To prove the second assertion, call two pairs (n,m) and (n’, m’) equivalent if 
gced(n,m) = gcd(n’,m’). Breaking the set {(n,m) | deg(n) = N, deg(m) = 
M} into equivalence classes and counting leads to the identity 


min(N,M) 
QNtM= SNS” q? &(N-d,M-—d). 


d=0 


Suppose M,N > 1. The proof now proceeds by induction on the number 
M +N. The smallest value this number can have is 2, in which case the 
formula yields q? = ®(1,1) + q&(0,0), or (1,1) = q? —q=q?(1—q7!). 

Now suppose the formula is correct for all pairs N’,M’' > 1 with N’ + 
M’ < N+M. We may also suppose, by symmetry, that N < M. Then 


N—-1 
git% = 6(N,M)+ S— q?®(N —d,M —d)+q%&(0,M—N). 


d=1 
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For 1 <d < N-1 we have 6(N—d, M—d) = q@+N~-24(1—q7') whereas 
by the first part of the proof, ®(0, M — N) = q“—. Substituting into the 
above formula and simplifying slightly, 


gMtNn — = O(N, gMtN y qv 41 —q “1) 49! = ®(N, M)+ gM@tN-l 


The second assertion now follows immediately. 


This elegant proof is due to David Hayes. 
It is convenient to extend the definition of ®(N, M) to half integers by 
defining ®(N/2, M) = Oif N is odd. 


Proposition 17.12. Suppose 1<d< M —1. Then 
S> Sa(xm) =(9-1)7* SS Sa(Xm) = O(d/2,M) . 


m monic deg(m)=M 


deg(m)=M 
Proof. To begin with assume all sums are over monics. Then 
m m 
~ Suxm= YY (F)= Y LY CE). 
deg(m)=M deg(m)=M deg(n)=d deg(n)=d deg(m)=M 


If n is not a square, (*/n) is a non-trivial character modulo n. Thus, in 
this case, since M > degn = d, 


x ba) = 


deg(m)=M 


by the proof of Proposition 4.3. 
Now, suppose that n = n? is a square. Then (m/n) = (m/n1)? = 1 


whenever gced(m,n 1) = 1 and (m/n,)? = 0 otherwise. It follows that 


© = E (F) -00. 


n 
deg(m)=M deg(m)=M 
Thus 


> Sa(Xm) = S- Pn, (M) — &(d/2, M) ' 


deg(m)=M . deg(n1)=d/2 


To do the general case, let a € F* and sum over all am as m runs through 
the monics of degree M. The above calculation shows the answer is again 
equal to &(d/2, M). It follows that if we sum over all polynomials of degree 
d the answer is (q — 1)®(d/2, M). This completes the proof. 


We now have all the information we need to state our main results about 
averages of L-functions. We begin with the easiest case, averaging over all 
monics of fixed odd degree. 
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Theorem 17.13. Let M be odd and positive. We have, for alls € B with 


o#4, 
_M _  Ga(2s) _ _il 1—2s) “¢ 
q 2a L(8, Xm) — C4 (2s + 1) ( -) (q ) Ca(2s) ’ 
deg(m)=M 


hors = 5 we have 


a> L(1/2,xm) =1+ (1-2) (“S) | 


m monic 


deg(m)=M 


Proof. By the corollary to Lemma 17.10, L(s, ym) = a Sa(Xm)q7%. 
From this, Proposition 17.11 and Proposition 17.12, we find 


» L(s,xm) = > ( S- Sa(xm) )a~ 
d=0 


m monic m monic 


deg(m)=M deg(m)=M 


= M+ &(1, M)q-* + &(2, M)gu* +--+ &((M —1)/2, M)g~—9" 


= qM ( + € ~ =) fgi-?* + (gh)? et (gi) . 


The result for s = ‘ follows from this by substitution. For s 4 5 we sum 
the geometric series to derive 


1 1 — (gi?8) " 
—~M _ 1-2 
q L(s,xm) =14 (1-2) 4 a 
m monic 
deg(m)=M 
1 qi~?s 1 126, MH 
= | 1 — — | —*——— - [(1-- °\ 2 2s). 
+( :) 1 — gi~*s ( aK y 8 Gas) 


We have used the fact that ¢4(s) = (1—q!~*)71, a fact we will use again 
almost immediately. 

A close look at the last line shows that it only remains to identify the 
sum of the first two terms with a quotient of zeta values. This follows from 
the calculation 


i4(,-1) 2" yy eee _ te e™® as) 
q/ 1-—- qi-2s _ qi-2s 1— qi-2s C4(28 + 1) ; 
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Corollary 1. If R(s) > 5, then 


_M 3 Ca(28) 
L 4s 
, m monic \s Xm} ~ Ca(28 + 1)’ 
deg(m)=M 


as M —> oo through odd values. 


Proof. This follows immediately from the theorem together with the ob- 
servation that if R(s) > 4 then |gq!~75| < 1. 


Corollary 2. If M is odd and positive, then 


—M S- h, = Ca(2) Ma —1 


q m Ca(3) q 


m monic 


deg(m)=M 


Proof. We begin by substituting s = 1 into the identity given in the 
theorem. We find 


rE His BB (2) rte Boe 


m monic 

deg(m)=M 
The last equality follows from ¢4(2) = (1—q7')7}. 

By Theorem 17.8A and Theorem 17.8B, we see that L(1,%m) = hm 


VG _ 
, |m| 

—1 . . 
hmq 2 . Substituting this information into the last equation yields the 
result. 


We remark that in Theorem 17.13 and the corollaries we could have 
averaged over all polynomials of odd degree M instead of the monics of that 
degree and the result would be the same. This is implied by Proposition 
17.12. We leave the details to the reader. 

We are left with consideration of the two cases where deg(m) = M is 
even and the leading coefficient of m is either a square in F* or a non- 
square. These cases are complicated by the possibility of m being itself a 
square or a constant times a square. In these cases, k(,/m) is either equal 
to k or is a constant field extension. We wish to exclude both possibilities. 
This can be done without much difficulty, but the calculations are more 
involved. We will be content with stating the results in these cases and 
referring the reader to Hoffstein-Rosen [1], Section 1, for the proofs. 


Theorem 17.14. Let M be even and positive. The following sums are over 
all non-square monic polynomials of degree M. 


(1) Suppose s # $ or 1. Then 


g™“S—L(s,Xm) = ae - ( - 7 (q'~?8)¥ C4 (2s) 
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_q-¥ (a _ € _ *) a) "cals) ) . 


Ca(2s + 1) q 
(2) For s=1 we have 


DH) Le (oe (1-2)ar-n). 


Corollary 1. If Re(s) > 5 , then as M — oo though even integers, 


_ Ca(2s) 


Corollary 2. With the hypotheses of the theorem, we have 
2) mM 1 
MS himBm = (q—1)7? (ise % (2+ (2-2) M-1))) | 


Nothing mysterious happens at s = 7 We leave the evaluation of these 
averages at s = 5 to the exercises. 


We now state the result in the remaining case. 


Theorem 17.15. Let M be positive and even, and let y € F* be a non- 
square constant. The following sum is over all non-square monic polyno- 
mials of degree M. Fors # 5 we have 


q ™ S> L(s, Xym) = 


Be. ( _ | 1-28) Mf _a-¥ poo fe -( -+) ca 
Ga(as+i) \~ q (Dey Aa@)—a* \ Tams bg) rpg) 
Corollary 1. [f R(s) > 5 then as M — o6 through even integers, 


_ Ca(2s) 
q MY = L(s, xm) ~ CES . 


Corollary 2. With the hypotheses of the theorem, we have 


GSO ham = 2(q +1)! (S04 -1- r) . 


There is a question that could be asked about all the occurrences of 
expressions involving the zeta function, ¢4(s), which occur in these last 
few theorems. After all, all these expressions are simple rational functions 
of q~* that can be written down explicitly. We have maintained the zeta 
function notation for two reasons. First, it makes the analogy with average 
value results in the number field case more striking. Secondly, consider the 
following research project. Fix a global function field k other than F(T) as 
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base field and fix a ring of S-integers A in k. Investigate average value of 
class numbers of the integral closure of A in quadratic extensions of k. This 
would all have to be formulated more exactly, but after making everything 
precise it is fairly clear that special values of ¢,4(s) will appear in the answer. 
In this general situation, the zeta function ¢,4(s) is no longer as simple as in 
the rational function field case, nevertheless it will play a similar structural 
role. Thus, it seems reasonable to phrase the more elementary results in 
the way we have done it. 

We want to conclude this chapter by mentioning a number of refinements 
and generalizations of the above results on class numbers. 

The first refinement is to consider only polynomials m that are square- 
free. In this case, Om is the integral closure of A = F{[T] in K = k(./m). 
Thus the class numbers h,, are similar to the class numbers associated 
to quadratic number fields. In the language of binary quadratic forms, 
we would be restricting consideration to forms with fundamental discrimi- 
nants. Averaging in this case is surprisingly difficult. In Hoffstein-Rosen [1], 
the task is accomplished with the help of functions defined on the meta- 
plectic two-fold cover of GL(2, k..), where ko is the completion of k at the 
prime at infinity. 


Definition. For s € C, R(s) > %, define 
o(s) = | JQ —|Pi-? — [Pp eet) + Pps?) 
P 


It is easy to see the product converges uniformly and absolutely in the 
region under consideration. 


For simplicity we state the next theorem for the region R(s) > 1. The 
full result concerns the region R(s) > 3. 


Theorem 17.16. Let « > 0 be given and assume s € C with R(s) > 1. 


(1) If M = 2n+1 is odd, then 
(q—1)7*(gM —q¥1)? $9 L(8, Xm) = 6a(2)6a(28)e(s)+O(g *O~) , 
where the sum is over all square-free m such that deg(m) = M. 

(2) If M = 2n is even, then 


271 (q—1)71(q™4 —g4 7")? S> Ls, xm) = C4(2)6a(28)e(s) +O(Q *O-) 


where the sum is over all square-free m such that deg(m) = M 
and sgno(m) = 1, or over all square-free m with deg(m) = M and 
sgn,(m) = —1. 
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The problem of working out the generalization of these results to the 
case of arbitrary global function fields as base field has been solved by B. 
Fisher and S. Friedberg. They use a new technique of “double Dirichlet 
series.” ‘Their paper should appear soon. 

Motivated by questions about rank 2 Drinfeld modules on A = F[T] with 
complex multiplication by orders in quadratic extensions of k = F(T), D. 
Hayes has formulated average value results about degrees of “minimal ide- 
als” and j-invariants which refine Theorem 17.13. Results and conjectures 
in the case of all discriminants have been published in Hayes [7]. Using The- 
orem 17.16, he and his former student, Z. Chen, have also treated the case 
of averaging over fundamental discriminants. This has not yet appeared. 

Finally, we point out that one can move beyond the consideration of 
quadratic extensions. Let | be a prime dividing gq — 1. Then, F* contains a 
primitive /-th root of unity. It follows that every cyclic extension of degree 
l of k = F(T) is obtained by adjoining an /-th root of a polynomial in 
A. One can develop a theory of orders in such an extension and try to 
build a theory of average values of class numbers similar to what we have 
seen in the case of quadratic extensions. This was done in Rosen [3]. Even 
in the case where all discriminants are under consideration, the averaging 
process becomes more difficult. In this paper the case | = 3 is treated in 
detail. Later, in her Brown University Ph.D. thesis, G. Menochi was able 
to handle the case | = 5. The complication increases at each step. To get 
a completely general result looks out of reach without new methods. At 
this time, no one has attempted the task of averaging class numbers over 
fundamental discriminants when | > 2. 


Exercises 
1. Suppose f : Di — C and that Ave(f) exists in the sense defined at 
the beginning of this Chapter. Show that 


D 
Ave(f) = lim 27see2sn tt?) 
N—-+00 deg D<N I 


2. Let D = 5° a(P)P be an effective divisor of K. Let m > 0 be a 
positive integer We say that D is m-th power free if for each P, 
a(P) #0 implies m+ a(P). Let fm be the characteristic function of 
the m-th power free divisors. Show Ave(fm) = ¢«(m)71. 


3. Let m > 0 be a positive integer and D an effective divisor. Define 


Om(D)= S> NC™. 
0<CK<D 


Find an asymptotic formula for S,,(N) = D deg p=n Om(D). 
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. Let D =) a(P)P be an effective divisor of K. Define u(D) = 1 if D 
is the zero divisor, 4(D) = (—1)* if D is square-free and exactly t of 
the coefficients a(P) are not zero, and u(D) = 0 otherwise. For every 
fixed « > 0, show 


S> m(D) = Oto) . 


deg D=N 


. Let D be an effective divisor of K. Define d,,(D) to be the number 
of m+ 1-tuples of effective divisors (C1, C2,...,Cm,Cm+1) such that 
mtr C; = D. Note that dj(D) = d(D) is equal to the number of 
divisors of D. Show that ¢4,,(s) = ¢x(s)™t' and use Theorem 17.4 
to derive an asymptotic formula for Am(N) = lace p=n Gm(D). 


. Prove Theorems 17.14 and 17.15. 


. In the situations of Theorem 17.14 and 17.15 find a formula for 


g™@ So L(A/2,xXm) - 
deg mM 


Appendix 


A Proof of the Function Field 
Riemann Hypothesis 


In this Appendix we will give a detailed exposition of E. Bombieri’s proof 
of the Riemann Hypothesis for function fields over finite fields or, in other 
language, for curves over finite fields. For the statement, see Theorem 5.10 
of Chapter 5 or Theorem A7 below. 

For hyperelliptic curves this result was first conjectured, in the 1920s, 
by E. Artin. In the 1930s, H. Hasse made the first substantial contribution 
by proving it in the case of function fields of genus one (the case of elliptic 
curves). In the late 1940s, A. Weil found a way to prove the general result. 
In fact, he gave two proofs; one involved intersection theory on algebraic 
surfaces, the second involved l-adic representations and abelian varieties. 
Both proofs used sophisticated algebraic geometry. In fact Weil had to 
redo the foundations of algebraic geometry to provide the necessary back- 
ground for his proofs. It was a surprise then when S.A. Stepanov, in the late 
1960s, found a proof, albeit in special cases, which involved nothing deeper 
than the Riemann-Roch theorem. Soon thereafter, W. Schmidt was able 
ot use Stepanov’s ideas to prove the general result. Finally, E. Bombieri 
found a substantial simplification of the proof of Stepanov and Schmidt, 
see Bombieri [1], both for his original treatment and for references to this 
history. 

Although the proof we are about to give is extremely ingenious and 
“elementary” it has to be admitted that Weil’s original method, espe- 
cially the approach involving algebraic surfaces, is much more natural. 
However, this intersection-theoretic proof requires extensive background 
whereas Bombieri’s proof uses nothing more than material developed in 


this book. 
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It is possible to give Bombieri’s proof purely in the context of function 
fields without mentioning algebraic curves. The resulting treatment is log- 
ically coherent, but feels very artificial. As a compromise, we will assume 
that the reader is familiar with the beginnings of the theory of algebraic 
curves as is presented, for example, in Fulton [1], and sketch the connection 
between the algebraic-geometric language and the language we have used 
in this book. Having done that, we will switch back and forth as convenient. 


Let C' be a complete, non-singular algebraic curve defined over a finite 
field F. We assume that C is embedded in projective N-space, P’ (IF), where 
F denotes a fixed algebraic closure of F. Let K = F(C) be the function 
field of C' over F. Recall that a typical element of f € K is represented 
by a quotient F’/G where F and G are homogeneous polynomials of the 
same degree in the ring F[Xo, Xj,..., Xj], and where G does not vanish 
identically on C’. In this circumstance, G only vanishes at finitely many 
points of C’, and f defines an actual function on the complement of this set 
to F. K is an algebraic function field in one variable over F and one can 
show F is algebraically closed in K. Now, let K = F(C). Then K = KF = 
U,, KFn =U, Kn. 

There is a one to one correspondence between points on C(F) and primes 
in K. If a = [ao,a1,...,an] € C(F), let Og be the set of elements f € K 
represented by F'/G where G(a) 4 0 (F and G are homogeneous polyno- 
mials with coefficients in F). This is easily seen to be a ring. Let Py C Oo 
be the set of f € Og such that f(a) = 0. Then, Oy, is a discrete valuation 
ring and P, is its maximal ideal. The fact that O, is a dvr follows from 
the assumption that every point on C, in particular a, is non-singular. One 
can show that the map a — (Ojg,?P.,) is one to one and onto map from 


C(I) to the primes of the function field K. 

There is also a natural map from C(F) to the primes of K. If a € C(F), 
let A, be the set of elements in f € K which are represented by F'/G with 
G(a) 4 0 where the coefficients of both F and G are in F. Let Py be the 
set of elements f € Ry such that f(a) = 0. Then, R, is a dvr and Py is 
its maximal ideal. It is useful to remark that the residue class field, Ro, / Pa 
is generated over F by adjoining the ratios of the coordinates of a. We call 
that field F(a) and note that deg, P,, = [F(a) : F). 


The map a + (Ry, P.) from C(F) to the set of primes of K is onto, 


but it is not one to one. In fact, we have P, = Py if and only if ca = a’ 
for some 0 € Gal(F/F). An automorphism o operates on a point a = 
[a9,Q1,...,aN] by ca = [oap, 7Q1,..., 7a]. It follows from this that the 


number of points in C(F) which correspond to a given prime P of K is 
equal to degy P. An important special case of this remark is that there is 
a one to one correspondence between rational points on C, i.e. C(I), and 
primes P of K of degree 1. The number of primes of degree 1 of K was 
denoted by Ni (4) in Chapter 8. See Proposition 8.18 and also Proposition 
5.12. 

We define a rational map ¢: P% (IF) + P4 (IF) by 
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b([x0,21,...,2N]) = (v6, 27,..., 0%] . 


This is called the Frobenius morphism. It has the important property that 
an element is fixed under ¢ if and only if it is in P (IF). More generally, 
an element is fixed under ¢” if and only if it is in PY (F,). Since C is 
defined over F, C(IF) is defined by the vanishing of a set of homogeneous 
polynomials with coefficients in F. It follows easily that ¢ maps C(IF) to 
itself and that the fixed points of this action, C?, is equal to C(F). More 
generally, C? = C(F,,). 

As we saw in Chapter 8, the Galois group of K/K is isomorphic to the 
Galois group of F/F. The latter group is generated (topologically) by 7, 
the automorphism that takes y € F' to y%. We use the same letter 7 to 
denote the corresponding automorphism of K. We think of 7 as acting on 
the coefficients of functions. Note that ¢ and a have the same action on 


points of C(I). Using this and the definitions, we find that for a € C(F) we 
have mPa = P¢q). Thus, a prime P of K corresponds to a rational point 
if and only if 7P = P. 

Our first goal is to establish, under some mild restrictions, an upper 


bound for Ni (i). We will show 


Theorem Al. Let g be the genus of C and suppose (g +1)* <q and that 
q 1s an even power of the characteristic p. Then, 


Ni(K)<q+14+(2g+1) vq. 


Before getting to the proof of Theorem Al, we will need a number of 
preliminary results. 

We may assume that C(F) is non-empty since otherwise the Theorem 
is vacuous. Let o be a rational point, i.e. an element of C(F), and P, 
the corresponding prime of K. For each positive integer m define Ry, = 
L(mP.) = {f € K | (f) +mP, = 0}. Rm is a finite dimensional vector 
space over F and we know a lot about this dimension via the Riemann-Roch 
theorem, Theorem 5.4. 


Proposition A2. 
1 dim Ry, < dim R,, +1. 
dim Ry, > m—g+1 with equality ifm > 2g — 2. 


) 
2) dimRy<m-+i. 
) 
4) 


Rm °® © Ring 


5) f € Rm» implies fo ¢ is a g-th power and (fod) = qn '(f). 
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6) dim R?, =dimR, for e>0. 
7) dimR,,ocg=dimR,,. 


Proof. To prove 1, note that if f and g both have a pole of order m+ 1 
at P, then f/g has order 0 at P, and thus is congruent to a constant + 
modulo P,. It follows that f — yg = g(f/g —7) has a pole of order at most 
m, i.e. f —yg € Rm. 

Since Rp consists precisely of the constants, it has dimension 1. Assertion 
2 now follows from 1 and induction. 

Assertion 3 is simply Riemann’s Theorem, Theorem 6.6, together with 
degz Po = 1. 

We can deal with 4 and 5 simultaneously. Let f be represented by the 
quotient of two homogeneous polynomials with coefficients in F. Set \ = 


nm! € Gal(IF/F). We find 


_ F(o(@)) _ AF(a)? _ q 

Thus, fo ¢ = Af? and consequently (f o ¢) = gA(f) which proves 5. Also, 
if f € Rm, then so is Af since ordp, (Af) = ordzp,(f) = ordp,(f). Thus, 
fog=Af? € Rmq which proves 4. 

The map f > f? is a quasi-linear isomorphism of R,, with RP. which 
proves 6. 

Finally, to show 7 it is enough to check that f — f o¢ is one to one. If 
fop~= go, then Af? = Ag? so Af = Ag which implies f = g (apply 7 to 
both sides). 


If A is a subspace of R,, and B is a subspace of R, we denote by AB 
the subspace of R,,4., generated by all the products fg where f € A and 
gEB. 


Proposition A3. If ln? < q, then the natural homomorphism from 
Ri S% Rmod to RP (Rm o ¢) is an isomorphism. 


Proof. By Proposition A2, part 1, we see that R,, has a basis {f1, fo,..., 
fr} such that ordf; < ordf,,; for i = 1,2,...,t 1. Any element of the 
tensor product can be written in the form 


t 
Sig? @fiod, 


i=1 


where the g; are elements of R,. If such an element is in the kernel of the 
natural homomorphism, we would have a relation of the form 


Sg? (fi0¢) =0. 
i=l 
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We will show this can’t happen unless g; = 0 for alli = 1,2,...,-—1 and 
that will establish the Proposition. 
Suppose some g; # 0 and let r be the smallest such index. Then, 


t 
g? (frod)=— > gP (fiod). 
w=rt+l 
Taking the order of both sides at P, and using Proposition A2, part 5, we 
see 


p°ord g, + q ord f, > min (p%ord g; + q ord f;) > —lp* +q ord f+, . 
wr 


Thus, 
poord g, > —lp* + q(ord f-41 —ord fr) >¢q—Ip® >0. 


It follows that g, has a zero at P,. Since g, € R; it has no poles away 
from P, and a zero at P,. It follows that g, = 0. This contradicts our 
assumption, and so, completes the proof. 


Corollary. If lp < q, then dim RP (Rm oo) = (dim R;)(dim R,,). 


Proof. This follows directly from the Proposition and from Proposition 
A2, part (7). 


We have now completed the preliminaries. 


Proof of Theorem Al. The idea is to produce a function with a high 
order zero at each rational point and a small number of poles. We will see 
how this works as we go along. We continue to assume that lp® <q. 

We begin by defining a F-linear homomorphism 6 from R? (Rm 0° ¢) to 
Re ° Rm. Using the notation established in the proof of Proposition A3, this 
is given by 


5(S- 9? (fi0d)) = Soa? hi. 
1=1 1=1 


That 6 is well defined follows immediately from Proposition A3. The dimen- 
sion of the domain of 6 is greater than (!—g+1)(m—g+1) by Proposition 
A2, parts 6 and 7 and Riemann’s theorem, part 3. Assume that 1,m > g. 
Then, the image of 6 is contained in L((lp®° + m)P.) which has dimension 
lp’ +m—g+1, again using Proposition A2, part 3. Thus, 


dimker 6 > (1—g+1)(m—g+1) — (lp? +m—g+1). 
If the quantity on the right is positive, the kernel is not empty. Assume 


e 


this and let f = >, gf (f;0 ¢) be a non-zero element of ker 6. Ifo Aa € 
C(I), we calculate 


f(a) = Dd 9(2)” f(b(o)) = d9:(o)” fila) =0. 
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So, f must vanish at every point of C(I) except o. Moreover, by Propo- 
sition A2, part 5, and the fact that p° < q by hypothesis, we see from the 
expression for f as a sum that f is a p®-th power. Thus, 


p(Ni(K) —1) < degg(f)o = degg (foo < lp* + mq. 


We have used the fact that Rp,o¢ C Rmg by Proposition A2, part 4. This 
inequality yields 
Ni(K) <14+l+maqp< . 


Our proof of this inequality is subject to the conditions 


a) Ip’ <q. 
(b) Im>qg. 
(c) (l-~g+1)(m—g+4+1)>lp*+m— gl. 


We proceed to make suitable choices for /, m, and e so that these three 
conditions are fulfilled and makes the above inequality into the one asserted 
in the statement of the Theorem. 

We are assuming that q is an even power of p, so set q = p”° and choose 
e = b. Set m = p® + 2g. We now want to choose | so that condition (c) 
holds. Simplifying that inequality slightly, we need 


(l—g)(m+1—g) >lp 


or 
(l1—g)(p? +g +1) > lp” 
or q 
l> psa. 
gti? “9 


Let’s choose | = [gp’/(g+1)]+g+1 ([r] denotes the greatest integer less 
than or equal to r). With these choices for | and m, conditions b and c are 
fulfilled. 

Assuming (g+1)* < q we will now show that condition a is also fulfilled. 
Note that (g + 1)* < q implies (g + 1)? < p® which yields gp? + (g +1)? < 
(g+1)p°. Thus, 


Gb b 
rs Ll< e 1 
gti? +g p (1) 


This inequality implies | < p’, so that Ip® < p?° = q which is condition c. 
Let’s substitute our choices for |, m, and e into the inequality N,(K) < 
1+1-+mgp~*. Since, by Equation 1 we have | < p° we find 


Ni(K) <p? +14 (p? + 29)p? =q4+14+ (2941). 


This completes the proof of Theorem Al. 
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Having produced a good upper bound for N; (4), we now take up the task 
of producing a suitable lower bound. The method will involve consideration 
of Galois extensions of K. 

Let L/K denote a finite, geometric, Galois extension of K with Galois 
group G. Let L = FL and K = FK. Since we have assumed L/K is 
geometric, it follows that Gal(L/K) ~ Gal(L/K) = G. We will simply 
identify G with the Galois group of L/K. The Frobenius element 7 of 
L/L maps to the Frobenius element of K/K by restriction. We will denote 
both of these automorphisms by 7. Note that as automorphisms of L/K, 
m= commutes with G (use the fact that L is the composite of L and K). 

Let T denote the set of primes in K which lie above rational primes 
in K, i.e. those primes of K whose degree is equal to 1. As we have seen 
|| = Ni (XK). Also, we showed earlier that the primes in T are characterized 
by the condition 7P = P. Let T denote the primes in L lying above those 
in T’. If P € T then the set of primes above P are acted upon transitively by 
G. Also, 7 maps this set into itself since 7P = P. Thus, if $B € T there is a 
o € Gsuch that 7B = of. Moreover, the element a is uniquely determined 
if %/P is unramified. Let T’ C T be the set of unramified primes in T’. We 
have defined a map 


n:T' 9G. 


Definition. With the above notations, let T(c) d enote the set of unrami- 
fied primes 8 in T’ such that (98) = o. Let N(L/K,o) denote the number 
of elements in T’(c). 


A few observations will be useful. For each prime in T which is unram- 
ified in L, there are |G| primes above it in T. This follows from from the 
fundamental relation efg = |G| (see Proposition 9.3) since f = 1 because 
we a working over an algebraically closed constant field F. Thus, 


P| = |G|Ni(K) + O(1) . 


The error term depends on the number of ramified primes in L/K but 
is independent of g. We will later vary the constant field, i.e consider the 
fields K, = F,,K, and so the O(1) term will not matter much. 

Since T’ = = Use T(o), disjoint union, we find 


S> N(o = |G|Ni(K) + O(1) . (2) 


cEG 


We will need this relation shortly. 

For those readers who prefer more geometric language, L corresponds 
to a curve C’ defined over F and covering C, i.e. there is an epimorphism 
wy :C — C. The set T corresponds to the set of rational points C(F) on 
C and the set T’ corresponds to ~!(C(F). The fibers above the points in 
C'(F) are mapped into themselves by both ¢ (or 7) and G, etc. The whole 
argument can be given in either language. 
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Proposition A4. With the above notations and definitions let g be the 
genus of L (which is the same as that of L) and let o be an element of G. 
Suppose (g +1)* <q and that q is an even power of p. Then, 


Ni(o,L/K) <q+14+(29+1)V/4 . 


Proof. The proof is almost identical with the proof of Theorem Al. One 
supposes there is at least on rational prime $B, in LD and defines the vector 
spaces R, = L(nB,). In the proof of that Theorem we begin with defining 
a homomorphism 6 from R? ; (Rmodg) > RP "Rim. We modify that to get a 
map 6, from R? (Rm o¢) + R? (Rm oc) as follows 


Just as before, one invokes Proposition A3 to insure this map is well defined. 
The rest of the proof goes exactly as before with the one exception. After 
assuming | > g and m > g one has to show that the image of 6, has 
dimension less than or equal to lp° + m—g +1. 

If f € Rm, then foo € L(mo',). It follows that the image of 6, is 
contained in L(lp°$8, + moa~!,) which has dimension lp® + m — g +1 by 
Riemann-Roch. 

We leave it to the reader to check the remaining details. 


Proposition A5. With the same notations as above, for allo € G, 


q+1+(Ni(K) —q-1)|G| + O(/4) < Mo, L/K) . 


Proof. By Proposition A4, we have for each o € G, 
0<[g+1+(26+1)/¢- Nilo, L/K)). 
Sum over o and one finds 


0< SOL] < (+14 2941) VG -|GINi(K) + 0(1). 


oEG 


We have used Equation 2. Since each term in brackets is positive, we deduce 
g+14(29+1) V9-Ni(o, L/R) < (g+1+(29+1)@)IG|-|G|N(K)+0(1). 
From this inequality, the Proposition follows immediately. 


We are aiming to prove that Ni(K) = q+ O(,/q). Theorem A1 assures 
us that with mild restrictions N,(K) < q+ O(,/q) so we must derive the 
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inequality in the other direction. Proposition A5 allows us to do this quickly 
in a special case. Suppose we can find an element x € K such that K/F(z) 
is a finite, geometric, Galois extension. Let’s apply Proposition A5 to the 
pair of field K and F(z) instead of L and K. Since the number of rational 
primes in F(x) is exactly q¢ +1, we find g + Va) < Ni(o,K/F(x)) for 
each o € Gal(K /F(z). Summing over o and using Equation 2 again we find 
[K : F(x)]q + O(,/@) < [K : F(x)|Ni(K). Cancel [K : F(x)] and we have 
our proof that Ni(K) = q+ O(/@). 

In general, we cannot find an element x € K with all these nice proper- 
ties. However, since F is a perfect field, one can find an element x € K such 
that K/F(x) is separable (see Lang [4], Chapter VIII, Proposition 4.1). Let 
L be the Galois closure of K/F(z), i.e., the smallest algebraic extension 
of K that is Galois over F(x). It can happen that the constant field, E, 
of L is larger than F. If so replace F(x) by E(x) and K by EK. Then, all 
three extensions are geometric and L/E(z) is Galois. So, at the expense of 
making a small constant field extension, we can assume F is the constant 
field of L to begin with. We shall see that small constant field extensions 
will not affect the overall proof. 


Theorem A6. Let K/F be a function field of genus g over a finite field 
F with q elements. Suppose q is an even power of p. Suppose further that 
there is an element x € K such that the Galois closure, L, of K/F(x) 
is a geometric extension of F(x). Finally, assume (g + 1)* < q. Then, 


Ni(K) = q+ O(,/9). 


Proof. By Theorem Al, Ni(K) < ¢+ O(/@), so it remains to prove the 
opposite inequality. 

Let G = Gal(L/F(x)) and H = Gal(L/K). Let o € G. Applying Propo- 
sition A5 to the extension L/F(x) we find 


q+ O(/q) < Ni(o, L/E(2)) . 


Sum these inequalities over all elements in 7 € H. We obtain 


H|q+ O(a) < 4) My (7, L/F(a)) . 
TECH 
We will show in a moment that if 7 €¢ H, Ny(7, L/F(x)) = Ni(7, L/K). 


Assuming this is correct, the sum in the last inequality is 


S| Ni(7, L/F(2)) = S> Nir, = |H|N,(K) + O(1) , 


TECH TECH 


using Equation 2 one more time. 

Putting the last two relations together, we have q + O(,/q) < Ni(K) 
which is the result we are looking for. 

It remains to prove Nj(r, L/F(x)) = Ni(7,L/K) if r € H. Let B be a 
prime of L lying over a rational prime P of F(z). If mB = 7P for 7 € H, 
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we have to show that $B lies over a rational prime of K. Let p lie below % 
in K. Then, 78 = 78 implies mp = tp = p. However, we have seen that 
the relation mp = p characterizes the rational primes of kK’. This completes 
the proof. 


Our final task is to show that Theorem A6 is equivalent to the Riemann 
Hypothesis for function fields over finite fields. This is relatively simple. 


Theorem A.7 (the Riemann Hypothesis for Function Fields) Let K/F be 
a function field with finite constant field, F, having q elements. Let CxK(s) 
be the zeta function of K. All the zeros of ¢x(s) lie on the line R(s) = 5. 


Proof. If we make the substitution u = q~* we have 


Lx(u) 


SOS Ta) 


where 
29 


Lx(u) = Ila — mu) . 


Here, g denotes the genus of K. As we have pointed out in many places, the 


assertion that all the zeros of ¢x(s) lie on the line #(s) = § is equivalent 


to the assertion that all the inverse roots, 7;, of Lx (u) have absolute value 
JG: 

We first remark that to prove the Theorem, it suffices to prove it for any 
constant field extension, K, = F,K, of K. This follows from Proposition 
8.16 which asserts that 


Lx, (u) = [ja — 7; U) . 


Thus, if the Riemann Hypothesis is true for K,;, we would have |r| = q? for 
alli = 1,...,2g which implies, obviously, that |7;| = g2 for alli =1,...,2g. 
This is the Riemann Hypothesis for K. 

Let’s choose n so large that (g +1)* < q”. If q” is not an even power 
of p, replace n by 2n. Next, as we have seen, we may, by taking a finite 
extension F,,, of Fo,, assume there is an x € K,, such that K,,/Fm(z) 
is separable and that the Galois closure L of K,,/F (x) is a geometric 
extension of F,,(xz). Thus, we have shown that we can find an m > 1 so 
that all the conditions of Theorem A6 are fulfilled for K,,. By the last 
paragraph, to prove the Riemann Hypothesis for K we may as well assume 
all these properties hold already for K/F. If these conditions hold for K/F, 
they hold in any constant field extension. Theorem A6 then implies that 
N,(Kn) = q* + O(q?) for all n > 1. 

We recall some facts proved in the text, namely at the end of Chapters 
5 and 8. First, Ni(K,) = N,(K) where N,,(K) is defined by N,,(K) = 
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dvajn 4aa(K). The number aq(K) denotes the number of prime divisors of 
K of K-degree equal to d. Moreover, 


ZK (u = C€Xp (yo CO) 


Taking the logarithmic derivative of Zx(w) and using the above identity 
yields 


Now write Zz (u) = “9 (1 —m,u)/(1—u)(1—qu) and calculate uZ,(u)/ 
ZK (u) using this formula. We find 


(uw) CoO 
= La nm Tr nm 
(u) +1—m} — my —+++— 1g,)u . 


Combining these formulas produces the following identity. 


So (Wi (Kn) = a = Yul = = > Y(m)" 


Since Ni(K,) = g”™+O(q2), the sum on the left has radius of convergence 
at least gq~ 2. The radius of convergence of the sum on the right is exactly 
the minimum over i of the quantities |7;|~!. Thus, |m;| <q? for all i. 
We know that 7; — q/7; is a permutation of the set of inverse roots of 
Lx (u) (see the remarks following Theorem 5.9 where this fact is shown to 
equivalent to the functional equation for Cx(s)). It follows that |7;| = q? 


for 1 = 1,2,...,2g. This is what we wanted to prove!! 
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